文件名称:Text-Classification_libSVM
- 所属分类:
- 人工智能/神经网络/遗传算法
- 资源属性:
- [Windows] [程序]
- 上传时间:
- 2013-04-01
- 文件大小:
- 4.04mb
- 下载次数:
- 0次
- 提 供 者:
- 李**
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
用seg进行分词
输入参数一:输入文本语料所在的文件夹路径。 如 文本文件语料都放在 train//text 文件夹下,则参数为:train//text//* 。 注意:必须每篇文章在一个txt文本中。
输入参数二:输入存储分词后的结果文件所在的文件夹路径:如:result//text。注意:不需要加*
本工具采用了中科院的中文分词工具,ICTCLAS,请自行到ICTCLAS官网下载该工具。并把Data文件夹,Configure.xml,ICTCLAS30.h,ICTCLAS30.lib,ICTCLAS30.dll放在和seg.exe同文件夹下面。
2.getFea-Seg segmentation
Input parameters: input text corpus where the folder path. Such as text documents corpus on the train// text folder under the parameters: train// text//*. Note: You must each article in a txt text.
Input parameters: input memory segmentation results file folder path: such as: result// text. Note: You do not need to add*
This tool uses the Chinese Academy of Sciences of the Chinese word segmentation tools ICTCLAS your own to ICTCLAS official website to download the tool. And the Data folder Configure.xml, ICTCLAS30.h ICTCLAS30.lib, ICTCLAS30.dll placed and seg.exe same folder below.
2.getFea
输入参数一:输入文本语料所在的文件夹路径。 如 文本文件语料都放在 train//text 文件夹下,则参数为:train//text//* 。 注意:必须每篇文章在一个txt文本中。
输入参数二:输入存储分词后的结果文件所在的文件夹路径:如:result//text。注意:不需要加*
本工具采用了中科院的中文分词工具,ICTCLAS,请自行到ICTCLAS官网下载该工具。并把Data文件夹,Configure.xml,ICTCLAS30.h,ICTCLAS30.lib,ICTCLAS30.dll放在和seg.exe同文件夹下面。
2.getFea-Seg segmentation
Input parameters: input text corpus where the folder path. Such as text documents corpus on the train// text folder under the parameters: train// text//*. Note: You must each article in a txt text.
Input parameters: input memory segmentation results file folder path: such as: result// text. Note: You do not need to add*
This tool uses the Chinese Academy of Sciences of the Chinese word segmentation tools ICTCLAS your own to ICTCLAS official website to download the tool. And the Data folder Configure.xml, ICTCLAS30.h ICTCLAS30.lib, ICTCLAS30.dll placed and seg.exe same folder below.
2.getFea
(系统自动生成,下载前可以参看下载内容)
下载文件列表
Configure.xml
Data
....\BiWord.big
....\charset.type
....\CoreDict.pdat
....\CoreDict.pos
....\CoreDict.unig
....\FieldDict.pdat
....\FieldDict.pos
....\GranDict.pdat
....\GranDict.pos
....\ICTCLAS30.ctx
....\ICTCLAS_First.map
....\ICTPOS.map
....\nr.ctx
....\nr.fsa
....\nr.role
....\PKU.map
....\PKU_First.map
dict.txt
feature
featureselection.exe
.......\3.txt
.......\4.txt
.......\5.txt
.......\6.txt
getFeature.exe
getRandFile.exe
getSVMfeture(df).exe
getSVMTtrain.exe
ICTCLAS30.dll
ICTCLAS30.log
mergeFile.bat
readme.txt
readme文本分类的主要流程.txt
seg
seg.exe
...\3.txt
...\4.txt
...\5.txt
...\6.txt
...\7.txt
...\8.txt
...\9.txt
...\test1.txt
...\test2.txt
svmfeature
..........\3.txt
..........\4.txt
..........\5.txt
..........\6.txt
..........\7.txt
..........\8.txt
svmtrain
........\svm.scale
........\train.scale
train
.....\3.txt
.....\4.txt
.....\5.txt
.....\6.txt
.....\7.txt
.....\8.txt
.....\9.txt
.....\test1.txt
.....\test2.txt