文件名称:pz
- 所属分类:
- JSP源码/Java
- 资源属性:
- [Linux] [C/C++] [源码]
- 上传时间:
- 2012-11-26
- 文件大小:
- 5.5mb
- 下载次数:
- 0次
- 提 供 者:
- x**
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
垂直搜索的网络爬虫,收集新闻信息的爬虫,采用java编写,附带源代码.-Vertical search network reptiles, reptiles to collect news and information, using java to prepare, with the source code
(系统自动生成,下载前可以参看下载内容)
下载文件列表
WebNewsCrawler-1.0
..................\bin
..................\...\conf
..................\...\....\commons-logging.properties
..................\...\....\crawler.properties
..................\...\....\file.types
..................\...\....\html2xml.properties
..................\...\....\jtidy.properties
..................\...\....\keycontent.properties
..................\...\....\log4j.properties
..................\...\....\mime.types
..................\...\....\preffered.encodings
..................\...\....\scripts
..................\...\....\.......\google.blognewschannel.script
..................\...\....\.......\researchbuzz.script
..................\...\....\.......\search-marketing.info.script
..................\...\....\.......\sesq
..................\...\....\.......\slashdot.org.script
..................\...\....\serverlist.txt
..................\...\crawler-console.sh
..................\...\crawler-export.sh
..................\...\crawler-server.sh
..................\...\news-rss.crl
..................\...\res.jar
..................\...\senews.crl
..................\...\TanaSend.jar
..................\...\webnews-crawler.jar
..................\doc
..................\...\javadoc.tar.bz2
..................\LICENSE.txt
..................\licenses
..................\........\Apache1_1.LICENSE.txt
..................\........\Apache2_0.LICENSE.txt
..................\........\calpa.LICENSE.txt
..................\........\cpdetector.LICENSE.txt
..................\........\CyberNeko.LICENSE.txt
..................\........\Informa.LICENSE.txt
..................\........\JDOM.LICENSE.txt
..................\........\je.LICENSE.txt
..................\........\JGoodies.LICENSE.txt
..................\........\JTidy.LICENSE.txt
..................\........\LICENSE.txt
..................\README.txt
..................\src.tar.bz2
..................\bin
..................\...\conf
..................\...\....\commons-logging.properties
..................\...\....\crawler.properties
..................\...\....\file.types
..................\...\....\html2xml.properties
..................\...\....\jtidy.properties
..................\...\....\keycontent.properties
..................\...\....\log4j.properties
..................\...\....\mime.types
..................\...\....\preffered.encodings
..................\...\....\scripts
..................\...\....\.......\google.blognewschannel.script
..................\...\....\.......\researchbuzz.script
..................\...\....\.......\search-marketing.info.script
..................\...\....\.......\sesq
..................\...\....\.......\slashdot.org.script
..................\...\....\serverlist.txt
..................\...\crawler-console.sh
..................\...\crawler-export.sh
..................\...\crawler-server.sh
..................\...\news-rss.crl
..................\...\res.jar
..................\...\senews.crl
..................\...\TanaSend.jar
..................\...\webnews-crawler.jar
..................\doc
..................\...\javadoc.tar.bz2
..................\LICENSE.txt
..................\licenses
..................\........\Apache1_1.LICENSE.txt
..................\........\Apache2_0.LICENSE.txt
..................\........\calpa.LICENSE.txt
..................\........\cpdetector.LICENSE.txt
..................\........\CyberNeko.LICENSE.txt
..................\........\Informa.LICENSE.txt
..................\........\JDOM.LICENSE.txt
..................\........\je.LICENSE.txt
..................\........\JGoodies.LICENSE.txt
..................\........\JTidy.LICENSE.txt
..................\........\LICENSE.txt
..................\README.txt
..................\src.tar.bz2