文件名称:webharvest1-project
- 所属分类:
- xml/soap/webservice
- 资源属性:
- [Java] [源码]
- 上传时间:
- 2008-10-13
- 文件大小:
- 5.48mb
- 下载次数:
- 0次
- 提 供 者:
- 陈*
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
一款十分好用的网页信息抽取工具。利用了已经存在的诸如XSLT,Xquery等技术,很好地实现了基于xml/html的网页的数据抽取。
(系统自动生成,下载前可以参看下载内容)
下载文件列表
压缩包 : 29782208webharvest1-project.rar 列表 webharvest1-project\build.xml webharvest1-project\config\config.xsd webharvest1-project\config\log4j.properties webharvest1-project\config\MANIFEST.MF webharvest1-project\config webharvest1-project\examples\canon.xml webharvest1-project\examples\crawler.xml webharvest1-project\examples\expekt.xml webharvest1-project\examples\flickr.xml webharvest1-project\examples\functions.xml webharvest1-project\examples\google_images.xml webharvest1-project\examples\nytimes.xml webharvest1-project\examples\xquery.xml webharvest1-project\examples\yahoomail.xml webharvest1-project\examples webharvest1-project\lib\bsh.jar webharvest1-project\lib\commons-cli-1.1.jar webharvest1-project\lib\commons-codec-1.3.jar webharvest1-project\lib\commons-collections-3.1.jar webharvest1-project\lib\commons-httpclient-3.1.jar webharvest1-project\lib\commons-logging.jar webharvest1-project\lib\groovy-all-1.0.jar webharvest1-project\lib\htmlcleaner.jar webharvest1-project\lib\js.jar webharvest1-project\lib\log4j-1.2.13.jar webharvest1-project\lib\saxon8-dom.jar webharvest1-project\lib\saxon8.jar webharvest1-project\lib webharvest1-project\licences\apache_licence.txt webharvest1-project\licences\asm_licence.txt webharvest1-project\licences\beanshell_licence.txt webharvest1-project\licences\bounce_licence.txt webharvest1-project\licences\groovy_licence.txt webharvest1-project\licences\htmlcleaner_licence.txt webharvest1-project\licences\rhino_licence.txt webharvest1-project\licences\saxon_licence.txt webharvest1-project\licences\webharvest_licence.txt webharvest1-project\licences webharvest1-project\src\CommandLine.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl\AuthSSLInitializationError.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl\AuthSSLProtocolSocketFactory.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl\AuthSSLX509TrustManager.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl\EasySSLProtocolSocketFactory.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl\EasyX509TrustManager.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl\StrictSSLProtocolSocketFactory.java webharvest1-project\src\org\apache\commons\httpclient\contrib\ssl webharvest1-project\src\org\apache\commons\httpclient\contrib webharvest1-project\src\org\apache\commons\httpclient webharvest1-project\src\org\apache\commons webharvest1-project\src\org\apache webharvest1-project\src\org\bounce\text\ScrollableEditorPanel.java webharvest1-project\src\org\bounce\text\xml\WrappedXMLView.java webharvest1-project\src\org\bounce\text\xml\XMLContext.java webharvest1-project\src\org\bounce\text\xml\XMLDocument.java webharvest1-project\src\org\bounce\text\xml\XMLEditorKit.java webharvest1-project\src\org\bounce\text\xml\XMLInputReader.java webharvest1-project\src\org\bounce\text\xml\XMLInputStream.java webharvest1-project\src\org\bounce\text\xml\XmlParserUtils.java webharvest1-project\src\org\bounce\text\xml\XMLScanner.java webharvest1-project\src\org\bounce\text\xml\XMLStyleConstants.java webharvest1-project\src\org\bounce\text\xml\XMLView.java webharvest1-project\src\org\bounce\text\xml\XMLViewUtilities.java webharvest1-project\src\org\bounce\text\xml webharvest1-project\src\org\bounce\text webharvest1-project\src\org\bounce webharvest1-project\src\org\webharvest\definition\BaseElementDef.java webharvest1-project\src\org\webharvest\definition\CallDef.java webharvest1-project\src\org\webharvest\definition\CallParamDef.java webharvest1-project\src\org\webharvest\definition\CaseDef.java webharvest1-project\src\org\webharvest\definition\ConstantDef.java webharvest1-project\src\org\webharvest\definition\DefinitionResolver.java webharvest1-project\src\org\webharvest\definition\ElementInfo.java webharvest1-project\src\org\webharvest\definition\EmptyDef.java webharvest1-project\src\org\webharvest\definition\ExitDef.java webharvest1-project\src\org\webharvest\definition\FileDef.java webharvest1-project\src\org\webharvest\definition\FunctionDef.java webharvest1-project\src\org\webharvest\definition\HtmlToXmlDef.java webharvest1-project\src\org\webharvest\definition\HttpDef.java webharvest1-project\src\org\webharvest\definition\HttpHeaderDef.java webharvest1-project\src\org\webharvest\definition\HttpParamDef.java webharvest1-project\src\org\webharvest\definition\IElementDef.java webharvest1-project\src\org\webharvest\definition\IfDef.java webharvest1-project\src\org\webharvest\definition\IncludeDef.java webharvest1-project\src\org\webharvest\definition\LoopDef.java webharvest1-project\src\org\webharvest\definition\RegexpDef.java webharvest1-project\src\org\webharvest\definition\ReturnDef.java webharvest1-project\src\org\webharvest\definition\ScraperConfiguration.java webharvest1-project\src\org\webharvest\definition\ScriptDef.java webharvest1-project\src\org\webharvest\definition\TemplateDef.java webharvest1-project\src\org\webharvest\definition\TextDef.java webharvest1-project\src\org\webharvest\definition\TryDef.java webharvest1-project\src\org\webharvest\definition\VarDef.java webharvest1-project\src\org\webharvest\definition\VarDefDef.java webharvest1-project\src\org\webharvest\definition\WhileDef.java webharvest1-project\src\org\webharvest\definition\XmlNode.java webharvest1-project\src\org\webharvest\definition\XmlParser.java webharvest1-project\src\org\webharvest\definition\XPathDef.java webharvest1-project\src\org\webharvest\definition\XQueryDef.java webharvest1-project\src\org\webharvest\definition\XQueryExternalParamDef.java webharvest1-project\src\org\webharvest\definition\XsltDef.java webharvest1-project\src\org\webharvest\definition webharvest1-project\src\org\webharvest\exception\BaseException.java webharvest1-project\src\org\webharvest\exception\ConfigurationException.java webharvest1-project\src\org\webharvest\exception\ErrMsg.java webharvest1-project\src\org\webharvest\exception\FileException.java webharvest1-project\src\org\webharvest\exception\FunctionException.java webharvest1-project\src\org\webharvest\exception\HttpException.java webharvest1-project\src\org\webharvest\exception\ParserException.java webharvest1-project\src\org\webharvest\exception\ScraperXPathException.java webharvest1-project\src\org\webharvest\exception\ScraperXQueryException.java webharvest1-project\src\org\webharvest\exception\ScriptException.java webharvest1-project\src\org\webharvest\exception\TemplateException.java webharvest1-project\src\org\webharvest\exception\TemplaterException.java webharvest1-project\src\org\webharvest\exception\VariableException.java webharvest1-project\src\org\webharvest\exception\XsltException.java webharvest1-project\src\org\webharvest\exception webharvest1-project\src\org\webharvest\gui\AboutWindow.java webharvest1-project\src\org\webharvest\gui\AutoCompleter.java webharvest1-project\src\org\webharvest\gui\component\DropDownButton.java webharvest1-project\src\org\webharvest\gui\component\DropDownButtonListener.java webharvest1-project\src\org\webharvest\gui\component\GCPanel.java webharvest1-project\src\org\webharvest\gui\component\ProportionalSplitPane.java webharvest1-project\src\org\webharvest\gui\component webharvest1-project\src\org\webharvest\gui\ConfigDocument.java webharvest1-project\src\org\webharvest\gui\ConfigPanel.java webharvest1-project\src\org\webharvest\gui\DialogHelper.java webharvest1-project\src\org\webharvest\gui\FindReplaceDialog.java webharvest1-project\src\org\webharvest\gui\HelpFrame.java webharvest1-project\src\org\webharvest\gui\Ide.java webharvest1-project\src\org\webharvest\gui\NodeRenderer.java webharvest1-project\src\org\webharvest\gui\PropertiesGrid.java webharvest1-project\src\org\webharvest\gui\PropertiesGridModel.java webharvest1-project\src\org\webharvest\gui\ResourceManager.java webharvest1-project\src\org\webharvest\gui\resources\about.html webharvest1-project\src\org\webharvest\gui\resources\headerbg.jpg webharvest1-project\src\org\webharvest\gui\resources\help\basics.html webharvest1-project\src\org\webharvest\gui\resources\help\call.html webharvest1-project\src\org\webharvest\gui\resources\help\case.html webharvest1-project\src\org\webharvest\gui\resources\help\config.html webharvest1-project\src\org\webharvest\gui\resources\help\diagram1.gif webharvest1-project\src\org\webharvest\gui\resources\help\empty.html webharvest1-project\src\org\webharvest\gui\resources\help\exit.html webharvest1-project\src\org\webharvest\gui\resources\help\file.html webharvest1-project\src\org\webharvest\gui\resources\help\function.html webharvest1-project\src\org\webharvest\gui\resources\help\htmltoxml.html webharvest1-project\src\org\webharvest\gui\resources\help\http.html webharvest1-project\src\org\webharvest\gui\resources\help\httpheader.html webharvest1-project\src\org\webharvest\gui\resources\help\httpparam.html webharvest1-project\src\org\webharvest\gui\resources\help\httpproc.html webharvest1-project\src\org\webharvest\gui\resources\help\include.html webharvest1-project\src\org\webharvest\gui\resources\help\licence.html webharvest1-project\src\org\webharvest\gui\resources\help\loop.html webharvest1-project\src\org\webharvest\gui\resources\help\overview.html webharvest1-project\src\org\webharvest\gui\resources\help\regexp.html webharvest1-project\src\org\webharvest\gui\resources\help\release.html webharvest1-project\src\org\webharvest\gui\resources\help\return.html webharvest1-project\src\org\webharvest\gui\resources\help\script.html webharvest1-project\src\org\webharvest\gui\resources\help\sys.html webharvest1-project\src\org\webharvest\gui\resources\help\template.html webharvest1-project\src\org\webharvest\gui\resources\help\text.html webharvest1-project\src\org\webharvest\gui\resources\help\try.html webharvest1-project\src\org\webharvest\gui\resources\help\var.html webharvest1-project\src\org\webharvest\gui\resources\help\vardef.html webharvest1-project\src\org\webharvest\gui\resources\help\while.html webharvest1-project\src\org\webharvest\gui\resources\help\xpath.html webharvest1-project\src\org\webharvest\gui\resources\help\xquery.html webharvest1-project\src\org\webharvest\gui\resources\help\xslt.html webharvest1-project\src\org\webharvest\gui\resources\help webharvest1-project\src\org\webharvest\gui\resources\help.xml webharvest1-project\src\org\webharvest\gui\resources\icons\call.gif webharvest1-project\src\org\webharvest\gui\resources\icons\case.gif webharvest1-project\src\org\webharvest\gui\resources\icons\close.gif webharvest1-project\src\org\webharvest\gui\resources\icons\const.gif webharvest1-project\src\org\webharvest\gui\resources\icons\copy.gif webharvest1-project\src\org\webharvest\gui\resources\icons\cut.gif webharvest1-project\src\org\webharvest\gui\resources\icons\default.gif webharvest1-project\src\org\webharvest\gui\resources\icons\download.gif webharvest1-project\src\org\webharvest\gui\resources\icons\empty.gif webharvest1-project\src\org\webharvest\gui\resources\icons\file.gif webharvest1-project\src\org\webharvest\gui\resources\icons\find.gif webharvest1-project\src\org\webharvest\gui\resources\icons\function.gif webharvest1-project\src\org\webharvest\gui\resources\icons\help.gif webharvest1-project\src\org\webharvest\gui\resources\icons\help32.gif webharvest1-project\src\org\webharvest\gui\resources\icons\helpdir.gif webharvest1-project\src\org\webharvest\gui\resources\icons\helptopic.gif webharvest1-project\src\org\webharvest\gui\resources\icons\homepage.gif webharvest1-project\src\org\webharvest\gui\resources\icons\htmltoxml.gif webharvest1-project\src\org\webharvest\gui\resources\icons\html_type.gif webharvest1-project\src\org\webharvest\gui\resources\icons\http.gif webharvest1-project\src\org\webharvest\gui\resources\icons\httpparam.gif webharvest1-project\src\org\webharvest\gui\resources\icons\image_type.gif webharvest1-project\src\org\webharvest\gui\resources\icons\include.gif webharvest1-project\src\org\webharvest\gui\resources\icons\list_type.gif webharvest1-project\src\org\webharvest\gui\resources\icons\loop.gif webharvest1-project\src\org\webharvest\gui\resources\icons\new.gif webharvest1-project\src\org\webharvest\gui\resources\icons\none.gif webharvest1-project\src\org\webharvest\gui\resources\icons\open.gif webharvest1-project\src\org\webharvest\gui\resources\icons\paste.gif webharvest1-project\src\org\webharvest\gui\resources\icons\pause.gif webharvest1-project\src\org\webharvest\gui\resources\icons\prettyprint.gif webharvest1-project\src\org\webharvest\gui\resources\icons\process.gif webharvest1-project\src\org\webharvest\gui\resources\icons\redo.gif webharvest1-project\src\org\webharvest\gui\resources\icons\refresh.gif webharvest1-project\src\org\webharvest\gui\resources\icons\regexp.gif webharvest1-project\src\org\webharvest\gui\resources\icons\run.gif webharvest1-project\src\org\webharvest\gui\resources\icons\runparams.gif webharvest1-project\src\org\webharvest\gui\resources\icons\save.gif webharvest1-project\src\org\webharvest\gui\resources\icons\settings.gif webharvest1-project\src\org\webharvest\gui\resources\icons\small_error.gif webharvest1-project\src\org\webharvest\gui\resources\icons\small_finished.gif webharvest1-project\src\org\webharvest\gui\resources\icons\small_paused.gif webharvest1-project\src\org\webharvest\gui\resources\icons\small_run.gif webharvest1-project\src\org\webharvest\gui\resources\icons\small_view.gif webharvest1-project\src\org\webharvest\gui\resources\icons\stop.gif webharvest1-project\src\org\webharvest\gui\resources\icons\template.gif webharvest1-project\src\org\webharvest\gui\resources\icons\text.gif webharvest1-project\src\org\webharvest\gui\resources\icons\text_type.gif webharvest1-project\src\org\webharvest\gui\resources\icons\Thumbs.db webharvest1-project\src\org\webharvest\gui\resources\icons\trashcan.gif webharvest1-project\src\org\webharvest\gui\resources\icons\try.gif webharvest1-project\src\org\webharvest\gui\resources\icons\undo.gif webharvest1-project\src\org\webharvest\gui\resources\icons\validate.gif webharvest1-project\src\org\webharvest\gui\resources\icons\var.gif webharvest1-project\src\org\webharvest\gui\resources\icons\vardef.gif webharvest1-project\src\org\webharvest\gui\resources\icons\view.gif webharvest1-project\src\org\webharvest\gui\resources\icons\webharvest.gif webharvest1-project\src\org\webharvest\gui\resources\icons\xml_type.gif webharvest1-project\src\org\webharvest\gui\resources\icons\xpath.gif webharvest1-project\src\org\webharvest\gui\resources\icons\xquery.gif webharvest1-project\src\org\webharvest\gui\resources\icons\xslt.gif webharvest1-project\src\org\webharvest\gui\resources\icons\zoomin.gif webharvest1-project\src\org\webharvest\gui\resources\icons\zoomout.gif webharvest1-project\src\org\webharvest\gui\resources\icons webharvest1-project\src\org\webharvest\gui\resources\welcome.html webharvest1-project\src\org\webharvest\gui\resources\welcomelogo.jpg webharvest1-project\src\org\webharvest\gui\resources webharvest1-project\src\org\webharvest\gui\RunParamsDialog.java webharvest1-project\src\org\webharvest\gui\ScraperExecutionThread.java webharvest1-project\src\org\webharvest\gui\Settings.java webharvest1-project\src\org\webharvest\gui\SettingsDialog.java webharvest1-project\src\org\webharvest\gui\StatusBar.java webharvest1-project\src\org\webharvest\gui\TextAreaAppender.java webharvest1-project\src\org\webharvest\gui\TreeNodeInfo.java webharvest1-project\src\org\webharvest\gui\ViewerFrame.java webharvest1-project\src\org\webharvest\gui\WelcomePanel.java webharvest1-project\src\org\webharvest\gui\XmlEditorScrollPane.java webharvest1-project\src\org\webharvest\gui\XmlFileFilter.java webharvest1-project\src\org\webharvest\gui\XmlTextPane.java webharvest1-project\src\org\webharvest\gui webharvest1-project\src\org\webharvest\runtime\processors\BaseProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\BodyProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\CallParamProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\CallProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\CaseProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\ConstantProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\EmptyProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\ExitProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\FileProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\FunctionProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\HtmlToXmlProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\HttpHeaderProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\HttpParamProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\HttpProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\IncludeProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\LoopProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\ProcessorResolver.java webharvest1-project\src\org\webharvest\runtime\processors\RegexpProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\ReturnProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\ScriptProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\TemplateProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\TextProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\TryProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\VarDefProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\VarProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\WhileProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\XPathProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\XQueryExpressionPool.java webharvest1-project\src\org\webharvest\runtime\processors\XQueryProcessor.java webharvest1-project\src\org\webharvest\runtime\processors\XsltProcessor.java webharvest1-project\src\org\webharvest\runtime\processors webharvest1-project\src\org\webharvest\runtime\RuntimeConfig.java webharvest1-project\src\org\webharvest\runtime\Scraper.java webharvest1-project\src\org\webharvest\runtime\ScraperContext.java webharvest1-project\src\org\webharvest\runtime\ScraperRuntimeListener.java webharvest1-project\src\org\webharvest\runtime\scripting\BeanShellScriptEngine.java webharvest1-project\src\org\webharvest\runtime\scripting\GroovyScriptEngine.java webharvest1-project\src\org\webharvest\runtime\scripting\JavascriptScriptEngine.java webharvest1-project\src\org\webharvest\runtime\scripting\ScriptEngine.java webharvest1-project\src\org\webharvest\runtime\scripting\SetContextVar.java webharvest1-project\src\org\webharvest\runtime\scripting webharvest1-project\src\org\webharvest\runtime\templaters\BaseTemplater.java webharvest1-project\src\org\webharvest\runtime\templaters webharvest1-project\src\org\webharvest\runtime\variables\EmptyVariable.java webharvest1-project\src\org\webharvest\runtime\variables\ListVariable.java webharvest1-project\src\org\webharvest\runtime\variables\NodeVariable.java webharvest1-project\src\org\webharvest\runtime\variables\Types.java webharvest1-project\src\org\webharvest\runtime\variables\Variable.java webharvest1-project\src\org\webharvest\runtime\variables webharvest1-project\src\org\webharvest\runtime\web\HttpClientManager.java webharvest1-project\src\org\webharvest\runtime\web\HttpInfo.java webharvest1-project\src\org\webharvest\runtime\web\HttpResponseWrapper.java webharvest1-project\src\org\webharvest\runtime\web\IHttpManager.java webharvest1-project\src\org\webharvest\runtime\web webharvest1-project\src\org\webharvest\runtime webharvest1-project\src\org\webharvest\utils\Catalog.java webharvest1-project\src\org\webharvest\utils\CommonUtil.java webharvest1-project\src\org\webharvest\utils\Constants.java webharvest1-project\src\org\webharvest\utils\KeyValuePair.java webharvest1-project\src\org\webharvest\utils\Stack.java webharvest1-project\src\org\webharvest\utils\SystemUtilities.java webharvest1-project\src\org\webharvest\utils\XmlNodeWrapper.java webharvest1-project\src\org\webharvest\utils\XmlUtil.java webharvest1-project\src\org\webharvest\utils\XmlValidator.java webharvest1-project\src\org\webharvest\utils webharvest1-project\src\org\webharvest webharvest1-project\src\org webharvest1-project\src\Test.java webharvest1-project\src webharvest1-project