文件名称:CosineSimilarAlgorithmzf
介绍说明--下载内容均来自于网络,请自行研究使用
这里会用到TF/IDF权重,用余弦夹角计算文本相似度,用方差计算两个数据间欧式距离,用k-means进行数据聚类等数学和统计知识。-Here will use the TF/IDF weight, with cosine angle calculation of text similarity, with the variance of the two data between the data of the European distance, with K-means data clustering and other mathematical and statistical knowledge.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
CosineSimilarAlgorithmzf
........................\.classpath
........................\.gitignore
........................\.project
........................\.settings
........................\.........\org.eclipse.core.resources.prefs
........................\.........\org.eclipse.jdt.core.prefs
........................\README.md
........................\lib
........................\...\ik2012lucene4.jar
........................\src
........................\...\com
........................\...\...\xq
........................\...\...\..\algorithm
........................\...\...\..\.........\CosineSimilarAlgorithm.java
........................\...\...\..\.........\SimpleSummariserAlgorithm.java
........................\...\...\..\.........\SpellCorrect.java
........................\...\...\..\.........\TfIdfAlgorithm.java
........................\...\...\..\.........\WordTrie.java
........................\...\...\..\.........\WordTrieMain.java
........................\...\...\..\.........\big.txt
........................\...\ik
........................\...\..\IKAnalyzer.cfg.xml
........................\...\..\ext.dic
........................\...\..\main2012.dic
........................\...\..\quantifier.dic
........................\...\..\stopword.dic