搜索资源列表
chentian.nutch
- 实现了基于词库的nutch中文分词,主要修改了其中的.jj文件等-realized based on the thesaurus nutch Chinese word, the main change of them. Jj documents
chentian.fenci
- 实现了基于词库的nutch中文分词,这一部分是其中的dll文件-realized based on the thesaurus nutch Chinese word, this part is one of the dll file
hadoop-0.7.1.tar
- hadoop:Nutch集群平台,分布式编程模式,让Nutch可以自动在普通机器组成的集群中以并行方式分布执行-hadoop : Cluster Nutch software platform, distributed programming model, Let Nutch software can be automatically composed of general machinery cluster parallel to t
nutch-0.8
- nutch-0.8刚出来不久的一个很好用的搜索引擎工具 nutch-0.8刚出来不久的一个很好用的搜索引-nutch-0.8 has just come out near a very good tool to use search engine nutch-0.8 has just come out soon with a good primer of english
nutchkk
- nutch搜索的改进型工具和优化爬虫的相关工具-nutch improved search tools and optimization of the related tools reptiles
nutch_recrawl_mergecrawl
- nutch一款开源搜索引擎,recrawl是实现索引更新的脚本 mergecrawl是合并多个网站查询的bash脚本。-nutch a open source search engine, recrawl the realization of the scr ipt to update the index is to consolidate multiple sites mergecrawl query bash scr ipt.
Lucene+Nutch
- 该书首先描述了开发平台的配置, 接着详细介绍LUCENE和NUTCH开发。-The book first describes the development platform configuration, and then details the development of Lucene and NUTCH.
Lucene+Nutch
- Lucene+nuctch一书的全部源码 测试源码 和几个简单的项目-Lucene+ Nuctch a book all the source code and test a few simple items
luceneAndnutch
- Lucene+nutch构建搜索引擎原书光般内容-the source code of use Lucene+ nutch to build a search engine
Lucenechapter11
- nutch的小应用 ,看看应该对学习检索系统原理很有帮助-nutch small applications, take a look at should be very helpful to study the principle of retrieval system
nutchjar
- 搜索引擎nutch源码在eclipse中运行时所缺的俩个包,引进即可使用。-Nutch search engine in the eclipse source code at run-time is a lack of both a package, you can use to introduce.
code
- 《lucene+nutch搜索引擎开发》源代码-" Lucene+ nutch search engine development," the source code
nutch
- 开源搜索引擎nutch的一些文档资料 包括安装 以及其中各个文件的解释-Open source search engine nutch some documentation, including installation, as well as the interpretation of various documents
data1
- 《开发自己的搜索引擎——lucene+nutch》(第2版)搜索引擎数据镜像数据1-lucene+nutch,data mirror
Nutch-Web
- 在对目前具有代表性的开源网络抓取软件Nutch、Heritrix、WCT、Web-Harvest进行比较分析的基础上,提出基于Nutch的Web网站定向采集系统,并对种子站点的选取、抓取过程管理、网页去噪、新种子站点的发现等关 键问题进行重点探讨。 -The paperanalyzes typicalopen sourceWeb crawl software, such asNutch, Heritrix, WCT, andWe
nutch
- nutch视频 简单搭建环境 搜索引擎 视频讲解 容易-own yourself search engine
apache-nutch-1.3-src.tar
- apache-nutch-1.3 的源码包,需要的可以看下-apache-nutch-1.3 source package, need to look
apache-nutch-1.2-src
- nutch-1.2用于开发自己的搜索引擎-apache nutch 1.2
apache-nutch-1.6-bin.tar
- nutch是阿帕奇的顶级业务,是专业的搜索引擎开发工具-nutch is the top wrok of Apache
apache-nutch-1.13-src
- 网络编程一个非常不错的开源网络爬虫学习代码!(windows network open source)