文件名称:url
介绍说明--下载内容均来自于网络,请自行研究使用
用JavaURL编程爬取并分析网页敏感词
1.编写界面,输入一个网址,能够爬取该网址上所有的HTML源代码。
2.对网址中的文本进行提取。
3.建立敏感词库,用文本文件保存。
4.将该网址所对应的文本中的敏感词提取并高亮显示。
5.编写文本文件,可以存入多个网址;程序可爬取这些网址中的文本内容,将敏感词记录存入另一个文件,格式自定。
6.编写一个主界面,整合上述功能。(Crawling and analyzing web sensitive words with Java URL programming
1. Write the interface, input a web address, and be able to crawl all HTML source code on the web address.
2. Extract the text in the website.
3. establish sensitive thesaurus and save it in text file.
4. Extract and highlight the sensitive words in the text corresponding to the website.
5. Write a text file, which can be stored in multiple web sites; the program can crawl the text content in these web sites, and store the sensitive word record in another file, with a format of its own.
6. Write a main interface to integrate the above functions.)
1.编写界面,输入一个网址,能够爬取该网址上所有的HTML源代码。
2.对网址中的文本进行提取。
3.建立敏感词库,用文本文件保存。
4.将该网址所对应的文本中的敏感词提取并高亮显示。
5.编写文本文件,可以存入多个网址;程序可爬取这些网址中的文本内容,将敏感词记录存入另一个文件,格式自定。
6.编写一个主界面,整合上述功能。(Crawling and analyzing web sensitive words with Java URL programming
1. Write the interface, input a web address, and be able to crawl all HTML source code on the web address.
2. Extract the text in the website.
3. establish sensitive thesaurus and save it in text file.
4. Extract and highlight the sensitive words in the text corresponding to the website.
5. Write a text file, which can be stored in multiple web sites; the program can crawl the text content in these web sites, and store the sensitive word record in another file, with a format of its own.
6. Write a main interface to integrate the above functions.)
相关搜索: Java;爬虫;敏感词
(系统自动生成,下载前可以参看下载内容)
下载文件列表
文件名 | 大小 | 更新时间 |
---|---|---|
url.java | 8353 | 2019-11-07 |