搜索资源列表
Music-Search
- 面向音乐搜索的垂直搜索软件是采用Java语言开发的简易的互联网音乐搜索引擎,它是集网络爬虫(采用java内置的多线程及Socket技术)和基于B/S结构的Web查询(Struts框架)为一体的搜索软件。可用于本科毕业设计。-Music Search Software
Craw
- 一个简单的Java爬虫框架,需要对自己要爬的网站写分析规则,可以自动设定下载线程数量,限制最大网速-A simple robot to catch content from site.
YukiSpider
- 基于HttpClient4.0的网络爬虫基本框架(Java实现)-Analog HTTP request: HttpClient 4.0 Target page structure analysis, HTTP request header information analysis: Firefox+ firebug/Chrome (F12 developer mode) HTML parsing: Jsoup
webmagic-master
- 一个爬虫框架,除了不会反爬虫外(当然可以自己加)其他都很牛逼,用java写的。-A crawler fr a me, besides will not reverse the crawler themselves are added (of course) other are very cow force, written in Java.
webmagic
- 开源的Java垂直爬虫框架,目标是简化爬虫的开发流程,让开发者专注于逻辑功能的开发。webmagic的核心非常简单,但是覆盖爬虫的整个流程,也是很好的学习爬虫开发的材料。作者曾经在前公司进行过一年的垂直爬虫的开发,webmagic就是为了解决爬虫开发的一些重复劳动而产生的框架。-Open source Java vertical crawler fr a mework, the goal is to simplify the devel
crawler
- 轻量级爬虫框架,可控制抓取深度 跟踪最初站源 可配置线程池 可配置UserAgent 可决定是否要抽取链接 Bloom Filter 可控制爬取速度 内置UserAgent池 支持Proxy池(Lightweight crawler fr a mework)
crawler4j-3.5-src
- 一款不错的用于java语言的爬虫框架,编程简单方便,编程人员不需具备较好的功底也能轻松使用(A good for Java language crawler fr a mework, programming simple and convenient, programmers need not have a good foundation, but also easy to use)
DownloadProxy
- webmagic框架实现网络爬虫,用java语言实现为爬虫添加代理(Using java language to add agents for reptiles)
weibo3.2
- WebCollector是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫。WebCollector-Hadoop是WebCollector的Hadoop版本,支持分布式爬取。(WebCollector is a JAVA crawler fr a mework (kernel) that does not need to be configured and easy t
java网络爬虫
- 是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫(Is a JAVA reptile fr a mework (kernel) that does not need to be configured for easy development. It provides a streamlined API that requires a small amount of co
WebCollector
- WebCollector爬虫框架源码,对于学习爬虫有很大的帮助(WebCollector crawler fr a mework source code)
WebCollector
- java爬虫框架,在eclipse编程环境中,可以良好运行(Java reptilian fr a me)
webcollector-2.32-bin
- WebCollector是一个无须配置、便于二次开发的JAVA爬虫框架(内核),它提供精简的的API,只需少量代码即可实现一个功能强大的爬虫。(WebCollector is a JAVA crawler fr a mework (kernel) that does not need to be configured and is easy to develop for two times. It provides a streamli