文件名称:crawler

介绍说明--下载内容均来自于网络,请自行研究使用

爬虫分布式版本实现,基于Map-Reduce进行了实现,非常有用-Reptile distributed version achieved, based on Map-Reduce was realized very useful
(系统自动生成,下载前可以参看下载内容)

下载文件列表





crawler\build.xml

.......\LISENCE

.......\README.txt

.......\seeds-csb.txt

.......\seeds-hadoop.txt

.......\seeds-hadoopcn.txt

.......\seeds-hi.txt

.......\seeds-localhost.txt

.......\seeds-nyt.txt

.......\seeds-scst.txt

.......\seeds-wiki.txt

.......\bin\crawler.sh

.......\conf\configuration.xsl

.......\....\joycrawler-csb.xml

.......\....\joycrawler-default.xml

.......\....\joycrawler-hadoop.xml

.......\....\joycrawler-hadoopcn.xml

.......\....\joycrawler-hi.xml

.......\....\joycrawler-localhost.xml

.......\....\joycrawler-nyt.xml

.......\....\joycrawler-scst.xml

.......\....\joycrawler-wiki.xml

.......\....\log4j.properties

.......\lib\commons-cli-2.0-SNAPSHOT.jar

.......\...\commons-httpclient-3.1.jar

.......\...\commons-logging-1.0.4.jar

.......\...\db.jar

.......\...\hadoop-0.20.1-core.jar

.......\...\log4j-1.2.15.jar

.......\...\lucene-core-3.0.0.jar

.......\...\lucene-smartcn-3.0.0.jar

.......\...\lucene-snowball-3.0.0.jar

.......\...\nekohtml.jar

.......\...\xercesImpl.jar

.......\...\xercesMinimal.jar

.......\...\xml-apis.jar

.......\...\native\libdb_java48.dll

.......\src\contrib\java\org\joy\analyzer\Analyzer.java

.......\...\.......\....\...\...\........\Document.java

.......\...\.......\....\...\...\........\DocumentCreationException.java

.......\...\.......\....\...\...\........\DocumentFactory.java

.......\...\.......\....\...\...\........\Hit.java

.......\...\.......\....\...\...\........\HitAnalyzer.java

.......\...\.......\....\...\...\........\Main.java

.......\...\.......\....\...\...\........\Paragraph.java

.......\...\.......\....\...\...\........\PipelineAnalyzer.java

.......\...\.......\....\...\...\........\TokenAnalyzer.java

.......\...\.......\....\...\...\........\html\Anchor.java

.......\...\.......\....\...\...\........\....\HTMLDocument.java

.......\...\.......\....\...\...\........\....\Main.form

.......\...\.......\....\...\...\........\....\Main.java

.......\...\.......\....\...\...\........\....\ParagraphSplitter.java

.......\...\.......\....\...\...\........\....\ParseException.java

.......\...\.......\....\...\...\........\....\Parser.java

.......\...\.......\....\...\...\........\....\TagWindow.java

.......\...\.......\....\...\...\........\....\TextExtractor.java

.......\...\.......\....\...\...\........\....\Utility.java

.......\...\.......\....\...\...\........\scoring\FrequencyScorer.java

.......\...\.......\....\...\...\........\.......\PWFScorer.java

.......\...\.......\....\...\...\........\.......\Scorer.java

.......\...\.......\....\...\...\........\.......\ZeroScorer.java

.......\...\.......\....\...\...\........\terms\SimpleTermExtractor.java

.......\...\.......\....\...\...\........\.....\TermExtractor.java

.......\...\.......\....\...\...\db\DB.java

.......\...\.......\....\...\...\..\DBCursor.java

.......\...\.......\....\...\...\..\DocHit.java

.......\...\.......\....\...\...\..\DocumentDB.java

.......\...\.......\....\...\...\..\DocumentEntry.java

.......\...\.......\....\...\...\..\Entry.java

.......\...\.......\....\...\...\..\Env.java

.......\...\.......\....\...\...\..\IndexDB.java

.......\...\.......\....\...\...\..\IndexEntry.java

.......\...\.......\....\...\...\..\MergedDocHits.java

.......\...\.......\....\...\...\..\Proximity.java

.......\...\.......\....\...\...\..\QueryServer.java

.......\...\.......\....\...\...\..\ResultEntry.java

.......\...\.......\....\...\...\..\SearchEntry.java

.......\...\.......\....\...\...\..\Searcher.java

.......\...\.......\....\...\...\..\query\Query.java

.......\...\.......\....\...\...\..\.....\SocketClient.java

.......\...\.......\....\...\...\..\.....\SocketServer.java

.......\...\.......\....\...\...\nlp\ChineseTokenizer.java

.......\...\.......\....\...\...\...\LuceneTokenizer.java

.......\...\.......\....\...\...\...\Word.java

.......\...\.......\....\...\...\...\WordTokenizer.java

.......\...\java\org\apache\hadoop\mapreduce\lib\input\KeyValueLineRecordReader.java

.......\...\....\...\......\......\.........\...\.....\KeyValueTextInputFormat.java

.......\...\....\...\joy\crawler\Crawler.java

.......\...\....\...\

相关说明

  • 本站资源为会员上传分享交流与学习,如有侵犯您的权益,请联系我们删除.
  • 本站是交换下载平台,提供交流渠道,下载内容来自于网络,除下载问题外,其它问题请自行百度更多...
  • 请直接用浏览器下载本站内容,不要使用迅雷之类的下载软件,用WinRAR最新版进行解压.
  • 如果您发现内容无法下载,请稍后再次尝试;或者到消费记录里找到下载记录反馈给我们.
  • 下载后发现下载的内容跟说明不相乎,请到消费记录里找到下载记录反馈给我们,经确认后退回积分.
  • 如下载前有疑问,可以通过点击"提供者"的名字,查看对方的联系方式,联系对方咨询.

相关评论

暂无评论内容.

发表评论

*主  题:
*内  容:
*验 证 码:

源码中国 www.ymcn.org