文件名称:文本聚类
- 所属分类:
- 中文信息处理
- 资源属性:
- [Java] [源码]
- 上传时间:
- 2013-05-14
- 文件大小:
- 4.74mb
- 下载次数:
- 0次
- 提 供 者:
- b8flowerfire@qq.com
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
对中文文本进行聚类分析,采用了中科院中文分词系统,使用TF-IDF进行文本词语权重的计算,采用向量模型计算文本的相似度,对中文文本进行分类。
(系统自动生成,下载前可以参看下载内容)
下载文件列表
压缩包 : TestClustering.rar 列表 TestClustering/.classpath TestClustering/.project TestClustering/.settings/org.eclipse.jdt.core.prefs TestClustering/bin/kevin/zhang/NLPIR.class TestClustering/bin/ruc/b8flowerfire/testcluster/TextCluster.class TestClustering/Data/BIG2GBK.map TestClustering/Data/BIG5.pdat TestClustering/Data/BIG5.wordlist TestClustering/Data/BiWord.big TestClustering/Data/charset.type TestClustering/Data/Configure.xml TestClustering/Data/CoreDict.pdat TestClustering/Data/CoreDict.pos TestClustering/Data/CoreDict.unig TestClustering/Data/FieldDict.pdat TestClustering/Data/FieldDict.pos TestClustering/Data/GBK.pdat TestClustering/Data/GBK.wordlist TestClustering/Data/GBK2BIG.map TestClustering/Data/GBK2GBKC.map TestClustering/Data/GBK2UTF.map TestClustering/Data/GBKA.pdat TestClustering/Data/GBKA.wordlist TestClustering/Data/GBKA2UTF.map TestClustering/Data/GBKC.pdat TestClustering/Data/GBKC.wordlist TestClustering/Data/GBKC2GBK.map TestClustering/Data/GranDict.pdat TestClustering/Data/GranDict.pos TestClustering/Data/ICTPOS.map TestClustering/Data/NewWord.lst TestClustering/Data/NLPIR.ctx TestClustering/Data/NLPIR.user TestClustering/Data/NLPIR_First.map TestClustering/Data/nr.ctx TestClustering/Data/nr.fsa TestClustering/Data/nr.role TestClustering/Data/PKU.map TestClustering/Data/PKU_First.map TestClustering/Data/UserDict.pdat TestClustering/Data/UTF2GBK.map TestClustering/Data/UTF2GBKA.map TestClustering/Data/UTF8.pdat TestClustering/Data/UTF8.wordlist TestClustering/NLPIR.dll TestClustering/NLPIR_JNI.dll TestClustering/output.txt TestClustering/src/kevin/zhang/NLPIR.java TestClustering/src/ruc/b8flowerfire/testcluster/TextCluster.java TestClustering/text/