文件名称:textclustering-master
介绍说明--下载内容均来自于网络,请自行研究使用
对于大文本进行挖掘聚类,该方法不考虑文字词语出现的频率信息,考虑上下文语境,将所有的字根据预定义的特征进行词位特征学习,获得一个训练模型。然后对待分字符串的每一个字进行词位标注,最后根据词位定义获得最终的分词结果。(Digging for large text clustering, the method does not consider the text word frequency of information, considering the context, all the words according to the characters of a predefined characteristics of word learning, to get a training model.Then each word of the character string is marked with the word bit, and finally the final word segmentation result is obtained according to the definition of the word bit.)
相关搜索: 文本聚类算法
(系统自动生成,下载前可以参看下载内容)
下载文件列表
文件名 | 大小 | 更新时间 |
---|---|---|
textclustering-master | 0 | 2014-06-29 |
textclustering-master\.gitignore | 110 | 2014-06-29 |
textclustering-master\.project | 543 | 2014-06-29 |
textclustering-master\C50.zip | 8194031 | 2014-06-29 |
textclustering-master\META-INF | 0 | 2014-06-29 |
textclustering-master\META-INF\MANIFEST.MF | 68 | 2014-06-29 |
textclustering-master\README.md | 616 | 2014-06-29 |
textclustering-master\data-no-duplicates.txt | 2169 | 2014-06-29 |
textclustering-master\data.csv | 33 | 2014-06-29 |
textclustering-master\data.txt | 2700 | 2014-06-29 |
textclustering-master\data.txt.bak | 5960 | 2014-06-29 |
textclustering-master\data | 0 | 2014-06-29 |
textclustering-master\data\data1-1.PNG | 26548 | 2014-06-29 |
textclustering-master\data\data1-1.eps | 1850925 | 2014-06-29 |
textclustering-master\data\data1.PNG | 26456 | 2014-06-29 |
textclustering-master\data\data1.eps | 370200 | 2014-06-29 |
textclustering-master\data\data1.txt | 1181 | 2014-06-29 |
textclustering-master\data\data1.xls | 14336 | 2014-06-29 |
textclustering-master\data\data2-1.PNG | 29384 | 2014-06-29 |
textclustering-master\data\data2-1.eps | 1867884 | 2014-06-29 |
textclustering-master\data\data2.eps | 373018 | 2014-06-29 |
textclustering-master\data\data2.txt | 1257 | 2014-06-29 |
textclustering-master\data\data2.xls | 17408 | 2014-06-29 |
textclustering-master\data\data3.eps | 14443 | 2014-06-29 |
textclustering-master\data\data3.txt | 16785 | 2014-06-29 |
textclustering-master\data\data3.txt.bak | 16070 | 2014-06-29 |
textclustering-master\data\dendrogram.eps | 1597385 | 2014-06-29 |
textclustering-master\data\graph1.doc | 233984 | 2014-06-29 |
textclustering-master\data\graph2.doc | 168960 | 2014-06-29 |
textclustering-master\data\groups.eps | 1843623 | 2014-06-29 |
textclustering-master\data\mopsi photo descriptions_all.xlsx | 129996 | 2014-06-29 |
textclustering-master\data\mopsi-service.txt | 3108 | 2014-06-29 |
textclustering-master\data\mopsi-service.txt.bak | 3107 | 2014-06-29 |
textclustering-master\data\vocabulary.doc | 33280 | 2014-06-29 |
textclustering-master\devtools | 0 | 2014-06-29 |
textclustering-master\devtools\data | 0 | 2014-06-29 |
textclustering-master\devtools\data\iris.data | 4551 | 2014-06-29 |
textclustering-master\file4.txt | 155 | 2014-06-29 |
textclustering-master\helper.py | 203 | 2014-06-29 |
textclustering-master\iris.csv | 4551 | 2014-06-29 |
textclustering-master\lib | 0 | 2014-06-29 |
textclustering-master\lib\Jama-1.0.2.jar | 32775 | 2014-06-29 |
textclustering-master\lib\ajt-2.9.jar | 139628 | 2014-06-29 |
textclustering-master\lib\commons-math-1.2.jar | 338488 | 2014-06-29 |
textclustering-master\lib\edu.sussex.nlp.jws.beta.11.jar | 65301 | 2014-06-29 |
textclustering-master\lib\edu | 0 | 2014-06-29 |
textclustering-master\lib\edu\sussex | 0 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp | 0 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws | 0 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\AdaptedLesk.java | 13036 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\AdaptedLeskTanimoto.java | 19163 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\AdaptedLeskTanimotoNoHyponyms.java | 18065 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\CompoundWords.java | 2287 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\DepthFinder.java | 14446 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\HirstAndStOnge.java | 26398 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\ICFinder.java | 6943 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\JWS.java | 6528 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\JWSRandom.java | 12735 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\JiangAndConrath.java | 16546 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\LeacockAndChodorow.java | 17104 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\LeskGlossOverlaps.java | 7872 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\Lin.java | 14297 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\Path.java | 14688 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\PathFinder.java | 3668 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\ReadMe(Legal).txt | 276 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\RelatedSynsets.java | 6170 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\Resnik.java | 14358 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\RootFinder.java | 2373 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\TestExamples.java | 3434 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\TestJWSRandom.java | 1158 | 2014-06-29 |
textclustering-master\lib\edu\sussex\nlp\jws\WuAndPalmer.java | 15491 | 2014-06-29 |
textclustering-master\lib\libsvm.jar | 49771 | 2014-06-29 |
textclustering-master\lib\lingpipe-3.9.3.jar | 1177117 | 2014-06-29 |
textclustering-master\lib\simmetrics_jar_v1_6_2_d07_02_07.jar | 133555 | 2014-06-29 |
textclustering-master\lib\weka.jar | 5283274 | 2014-06-29 |
textclustering-master\movement.data | 256293 | 2014-06-29 |
textclustering-master\pom.xml | 1156 | 2014-06-29 |
textclustering-master\result | 0 | 2014-06-29 |
textclustering-master\result\1_long_jiang.txt | 2749 | 2014-06-29 |
textclustering-master\result\1_long_path.txt | 2748 | 2014-06-29 |
textclustering-master\result\1_long_wu.txt | 2744 | 2014-06-29 |
textclustering-master\result\1_short_jiang.txt | 325 | 2014-06-29 |
textclustering-master\result\1_short_path.txt | 302 | 2014-06-29 |
textclustering-master\result\1_short_wu.txt | 325 | 2014-06-29 |
textclustering-master\simiMatrix.txt | 4635 | 2014-06-29 |
textclustering-master\src | 0 | 2014-06-29 |
textclustering-master\src\com | 0 | 2014-06-29 |
textclustering-master\src\com\aliasi | 0 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster | 0 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\AbstractHierarchicalClusterer.java | 6683 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\ClusterScore.java | 24361 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\Clusterer.java | 1844 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\Clustering_own.java | 12997 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\Clustering_own_2.java | 12983 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\CompleteLinkClusterer.java | 10128 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\CopyOfClustering_own.java | 12493 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\Dendrogram.java | 16075 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\HierarchicalClusterer.java | 2006 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\KMeansClusterer.java | 40914 | 2014-06-29 |
textclustering-master\src\com\aliasi\cluster\LatentDirichletAllocation.java | 92784 | 2014-06-29 |