文件名称:Perl
- 所属分类:
- Linux/Unix编程
- 资源属性:
- [Linux] [Perl] [源码]
- 上传时间:
- 2012-11-26
- 文件大小:
- 7kb
- 下载次数:
- 0次
- 提 供 者:
- liti*****
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
perl 实现数据分类 tokenization,抽取feature selection,文件分类documentation classification-The project’s goal is to provide an
application to provide a brief list for a set of
books in xml format then maybe people can
through this list to decide which book they
want to select or if some genre books are in
those set of books.So the application at least
can provide the Title, Author, Language,
Release Date and Genre fields. To provide
those informations, the application should
fetch test files and training files then process
those files to find the desired content then
store only the extracted content in
outputting file (books.xml). The extracted
content should help people to know what
those books are about. One import fact
which the application should provide is the
genre, because people maybe only want to
search a certain category of books.
So to implement tasks above, the first step,
the application tokenizes the books (test and
training xml files) to represent the documentfor extracting facts and decide classification.
This step should be careful to to
application to provide a brief list for a set of
books in xml format then maybe people can
through this list to decide which book they
want to select or if some genre books are in
those set of books.So the application at least
can provide the Title, Author, Language,
Release Date and Genre fields. To provide
those informations, the application should
fetch test files and training files then process
those files to find the desired content then
store only the extracted content in
outputting file (books.xml). The extracted
content should help people to know what
those books are about. One import fact
which the application should provide is the
genre, because people maybe only want to
search a certain category of books.
So to implement tasks above, the first step,
the application tokenizes the books (test and
training xml files) to represent the documentfor extracting facts and decide classification.
This step should be careful to to
(系统自动生成,下载前可以参看下载内容)
下载文件列表
bsquaredold
bsquared.pl
bsquared.pl