搜索资源列表
WebCollector-master
- 基于WebCollector内核,可以自己编写爬虫的http请求、链接解析器、爬取信息更新器、抓取器等模块,WebCollector把这些基于内核编写的模块称作 插件 ,通过不同的插件组合,可以在1分钟内,把WebCollector组装成一个全新的爬虫。 WebCollector内置了一套插件(cn.edu.hfut.dmic.webcollector.plugin.redis)。基于这套插件,可以把WebCollector的
webcollector-WebCollector-master
- 网络爬虫程序,可以实现对网页的爬去,易扩展,方便使用,直接导入jar包即可-Web crawlers can be achieved on pages crawled, scalable, easy to use, you can directly import the jar package
YahooCrawler
- 通过webcollector爬虫工具抓取雅虎网站的定的一些个网址,通过这些可以练习抓取网站-web crawler in yahoo
webcollector-WebCollector-master
- 这是一款很好用的网络爬虫工具,具有很好的demo。-This is a good use of web crawler tool, with a good demo.
WebCollector-master
- 爬虫 支持表单爬取,增加分布式支持。hadoop- Crawler Support form to climb, increase distributed support. Hadoop
weibo3.2
- WebCollector是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫。WebCollector-Hadoop是WebCollector的Hadoop版本,支持分布式爬取。(WebCollector is a JAVA crawler fr a mework (kernel) that does not need to be configured and easy t
webcollector-2.71-bin
- 网络爬虫代码,关于凤凰网和河工大的网页爬取。(Web crawler code, page crawling on phoenix net and river industry.)
WebCollector
- WebCollector爬虫框架源码,对于学习爬虫有很大的帮助(WebCollector crawler fr a mework source code)
WebCollector
- java爬虫框架,在eclipse编程环境中,可以良好运行(Java reptilian fr a me)
webcollector-2.32-bin
- WebCollector是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫。(WebCollector is a JAVA crawler fr a mework (kernel) that does not need to be configured and is easy to develop for two times. It provides a streamli