文件名称:TestOfWebharvest05-all
- 所属分类:
- JSP源码/Java
- 资源属性:
- [Java] [源码]
- 上传时间:
- 2012-11-26
- 文件大小:
- 5.47mb
- 下载次数:
- 0次
- 提 供 者:
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
Web-Harvest是一个Java开源Web数据抽取工具。它能够收集指定的Web页面并从这些页面中提取有用的数据。Web-Harvest主要是运用了像XSLT,XQuery,正则表达式等这些技术来实现对text/xml的操作。测试版本。-Web-Harvest is a Java open-source Web data extraction tool. It can collect the specified Web page and extracts from these pages useful data. Web-Harvest is mainly used as XSLT, XQuery, regular expressions, such as these technologies to realize on the text/xml operation. Test version.
相关搜索: xquery
in
java
web
抽取
XQuery
正则表达式
web
harvest
crawler
x
xml
抽取
Information
Extraction
Open
Source
Web
Harvest
xml
java
in
java
web
抽取
XQuery
正则表达式
web
harvest
crawler
x
xml
抽取
Information
Extraction
Open
Source
Web
Harvest
xml
java
(系统自动生成,下载前可以参看下载内容)
下载文件列表
build.xml
config
......\config.xsd
examples
........\canon.xml
........\crawler.xml
........\expekt.xml
........\functions.xml
........\google_images.xml
........\nytimes.xml
........\xquery.xml
lib
...\bsh.jar
...\commons-codec-1.3.jar
...\commons-collections-3.1.jar
...\commons-httpclient-3.0-rc3.jar
...\commons-logging.jar
...\htmlcleaner.jar
...\log4j-1.2.13.jar
...\saxon8.jar
src
...\CommandLine.java
...\org
...\...\apache
...\...\......\commons
...\...\......\.......\httpclient
...\...\......\.......\..........\contrib
...\...\......\.......\..........\.......\ssl
...\...\......\.......\..........\.......\...\AuthSSLInitializationError.java
...\...\......\.......\..........\.......\...\AuthSSLProtocolSocketFactory.java
...\...\......\.......\..........\.......\...\AuthSSLX509TrustManager.java
...\...\......\.......\..........\.......\...\EasySSLProtocolSocketFactory.java
...\...\......\.......\..........\.......\...\EasyX509TrustManager.java
...\...\......\.......\..........\.......\...\StrictSSLProtocolSocketFactory.java
...\...\webharvest
...\...\..........\definition
...\...\..........\..........\BaseElementDef.java
...\...\..........\..........\CallDef.java
...\...\..........\..........\CallParamDef.java
...\...\..........\..........\CaseDef.java
...\...\..........\..........\ConstantDef.java
...\...\..........\..........\DefinitionResolver.java
...\...\..........\..........\EmptyDef.java
...\...\..........\..........\FileDef.java
...\...\..........\..........\FunctionDef.java
...\...\..........\..........\HtmlToXmlDef.java
...\...\..........\..........\HttpDef.java
...\...\..........\..........\HttpHeaderDef.java
...\...\..........\..........\HttpParamDef.java
...\...\..........\..........\IElementDef.java
...\...\..........\..........\IfDef.java
...\...\..........\..........\IncludeDef.java
...\...\..........\..........\LoopDef.java
...\...\..........\..........\RegexpDef.java
...\...\..........\..........\ReturnDef.java
...\...\..........\..........\ScraperConfiguration.java
...\...\..........\..........\ScriptDef.java
...\...\..........\..........\TemplateDef.java
...\...\..........\..........\TextDef.java
...\...\..........\..........\TryDef.java
...\...\..........\..........\VarDef.java
...\...\..........\..........\VarDefDef.java
...\...\..........\..........\WhileDef.java
...\...\..........\..........\XmlNode.java
...\...\..........\..........\XmlParser.java
...\...\..........\..........\XPathDef.java
...\...\..........\..........\XQueryDef.java
...\...\..........\..........\XQueryExternalParamDef.java
...\...\..........\..........\XsltDef.java
...\...\..........\exception
...\...\..........\.........\BaseException.java
...\...\..........\.........\ConfigurationException.java
...\...\..........\.........\ErrMsg.java
...\...\..........\.........\FileException.java
...\...\..........\.........\FunctionException.java
...\...\..........\.........\HttpException.java
...\...\..........\.........\ParserException.java
...\...\..........\.........\ScraperXPathException.java
...\...\..........\.........\ScraperXQueryException.java
...\...\..........\.........\ScriptException.java
...\...\..........\.........\TemplateException.java
...\...\..........\.........\TemplaterException.java
...\...\..........\.........\VariableException.java
...\...\..........\.........\XsltException.java
...\...\..........\runtime
...\...\..........\.......\html
...\...\..........\.......\....\HtmlCleanerProcessor.java
...\...\..........\.......\....\IXHtmlProcessor.java
...\...\..........\.......\processors
...\...\..........\.......\..........\BaseProcessor.java
...\...\..........\.......\..........\CallParamProcessor.java
...\...\..........\.......\..........\CallProcessor.java
...\...\..........\.......\..........\CaseProcessor.java
...\...\..........\.......\..........\ConstantProcessor.java
...\...\..........\.......\..........\EmptyProcessor.java
...\...\..........\.......\..........\FileProcessor.java
...\...\..........\.......\..........\FunctionProcessor.java
...\...\..........\.......\..........\HtmlToXmlProcessor.java
...\..
config
......\config.xsd
examples
........\canon.xml
........\crawler.xml
........\expekt.xml
........\functions.xml
........\google_images.xml
........\nytimes.xml
........\xquery.xml
lib
...\bsh.jar
...\commons-codec-1.3.jar
...\commons-collections-3.1.jar
...\commons-httpclient-3.0-rc3.jar
...\commons-logging.jar
...\htmlcleaner.jar
...\log4j-1.2.13.jar
...\saxon8.jar
src
...\CommandLine.java
...\org
...\...\apache
...\...\......\commons
...\...\......\.......\httpclient
...\...\......\.......\..........\contrib
...\...\......\.......\..........\.......\ssl
...\...\......\.......\..........\.......\...\AuthSSLInitializationError.java
...\...\......\.......\..........\.......\...\AuthSSLProtocolSocketFactory.java
...\...\......\.......\..........\.......\...\AuthSSLX509TrustManager.java
...\...\......\.......\..........\.......\...\EasySSLProtocolSocketFactory.java
...\...\......\.......\..........\.......\...\EasyX509TrustManager.java
...\...\......\.......\..........\.......\...\StrictSSLProtocolSocketFactory.java
...\...\webharvest
...\...\..........\definition
...\...\..........\..........\BaseElementDef.java
...\...\..........\..........\CallDef.java
...\...\..........\..........\CallParamDef.java
...\...\..........\..........\CaseDef.java
...\...\..........\..........\ConstantDef.java
...\...\..........\..........\DefinitionResolver.java
...\...\..........\..........\EmptyDef.java
...\...\..........\..........\FileDef.java
...\...\..........\..........\FunctionDef.java
...\...\..........\..........\HtmlToXmlDef.java
...\...\..........\..........\HttpDef.java
...\...\..........\..........\HttpHeaderDef.java
...\...\..........\..........\HttpParamDef.java
...\...\..........\..........\IElementDef.java
...\...\..........\..........\IfDef.java
...\...\..........\..........\IncludeDef.java
...\...\..........\..........\LoopDef.java
...\...\..........\..........\RegexpDef.java
...\...\..........\..........\ReturnDef.java
...\...\..........\..........\ScraperConfiguration.java
...\...\..........\..........\ScriptDef.java
...\...\..........\..........\TemplateDef.java
...\...\..........\..........\TextDef.java
...\...\..........\..........\TryDef.java
...\...\..........\..........\VarDef.java
...\...\..........\..........\VarDefDef.java
...\...\..........\..........\WhileDef.java
...\...\..........\..........\XmlNode.java
...\...\..........\..........\XmlParser.java
...\...\..........\..........\XPathDef.java
...\...\..........\..........\XQueryDef.java
...\...\..........\..........\XQueryExternalParamDef.java
...\...\..........\..........\XsltDef.java
...\...\..........\exception
...\...\..........\.........\BaseException.java
...\...\..........\.........\ConfigurationException.java
...\...\..........\.........\ErrMsg.java
...\...\..........\.........\FileException.java
...\...\..........\.........\FunctionException.java
...\...\..........\.........\HttpException.java
...\...\..........\.........\ParserException.java
...\...\..........\.........\ScraperXPathException.java
...\...\..........\.........\ScraperXQueryException.java
...\...\..........\.........\ScriptException.java
...\...\..........\.........\TemplateException.java
...\...\..........\.........\TemplaterException.java
...\...\..........\.........\VariableException.java
...\...\..........\.........\XsltException.java
...\...\..........\runtime
...\...\..........\.......\html
...\...\..........\.......\....\HtmlCleanerProcessor.java
...\...\..........\.......\....\IXHtmlProcessor.java
...\...\..........\.......\processors
...\...\..........\.......\..........\BaseProcessor.java
...\...\..........\.......\..........\CallParamProcessor.java
...\...\..........\.......\..........\CallProcessor.java
...\...\..........\.......\..........\CaseProcessor.java
...\...\..........\.......\..........\ConstantProcessor.java
...\...\..........\.......\..........\EmptyProcessor.java
...\...\..........\.......\..........\FileProcessor.java
...\...\..........\.......\..........\FunctionProcessor.java
...\...\..........\.......\..........\HtmlToXmlProcessor.java
...\..