搜索资源列表
jtidy-04aug2000r7-dev
- JTidy的Jar包,用于清洗Html网页并可以将其转换为相应的Xml或是Xhtml文件。
jtidy-r938-sources
- 基于java的网页信息抽取小程序,可以抽取网页信息-Web information extraction based on java applets, can be extracted web page information
JTidy-lizi
- 用JTidy将html文件转换成xml文件例子,网上例程,实现了一下,1.html为样式文件,1.xslt为抽取规则(不同网页自己修改),1.xml为结果-Html file with JTidy to convert xml file example, online routine to achieve the look, 1.html as a style file, 1.xslt for the extraction rules