文件名称:ChineseSegment
- 所属分类:
- 人工智能/神经网络/遗传算法
- 资源属性:
- [Java] [源码]
- 上传时间:
- 2013-01-06
- 文件大小:
- 13.91mb
- 下载次数:
- 0次
- 提 供 者:
- 张**
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
一个完整的中文分词程序,有源码,词典,训练集。算法简洁高效,准确率高。包含了一种将标注语料和词典融合的新型分词方法。将语料分割为2:1为训练集和测试集,加上一个外部词典,准确率可以达到95 。适合入门者学习。也适合需要一个简单分词工具的应用。-A Chinese word segmentation procedures, source, dictionary, the training set. The algorithm is simple and efficient, high accuracy. The label contains a new segmentation method of integration of corpus and dictionaries. Corpus split 2:1 for the training set and a test set, plus an external dictionary, the accuracy rate can reach 95 . Suitable for beginners to learn. Also suitable for a simple wordsegmentation application.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
ChineseSegment\.classpath
..............\.fatjar
..............\.project
..............\.settings\org.eclipse.core.resources.prefs
..............\bin\org\tseg\seg\199801q.txt
..............\...\...\....\...\BigramSeg.class
..............\...\...\....\...\BiWordGraph.class
..............\...\...\....\...\biWordRate.out
..............\...\...\....\...\Count.class
..............\...\...\....\...\CountBiGram.class
..............\...\...\....\...\MergeNamedEntity.class
..............\...\...\....\...\SegModel.class
..............\...\...\....\...\SplitSentenceTest.class
..............\...\...\....\...\SplitSentence_seg.class
..............\...\...\....\...\UnigramSeg.class
..............\...\...\....\...\UnigramSegTest.class
..............\...\...\....\...\wordFrequence.out
..............\...\...\....\...\WordGraph.class
..............\...\...\....\...\wordRate.out
..............\...\...\....\...\词典.txt
..............\src\org\tseg\seg\199801q.txt
..............\...\...\....\...\BigramSeg.java
..............\...\...\....\...\BiWordGraph.java
..............\...\...\....\...\biWordRate.out
..............\...\...\....\...\Count.java
..............\...\...\....\...\CountBiGram.java
..............\...\...\....\...\MergeNamedEntity.java
..............\...\...\....\...\SegModel.java
..............\...\...\....\...\SplitSentenceTest.java
..............\...\...\....\...\SplitSentence_seg.java
..............\...\...\....\...\UnigramSeg.java
..............\...\...\....\...\UnigramSegTest.java
..............\...\...\....\...\wordFrequence.out
..............\...\...\....\...\WordGraph.java
..............\...\...\....\...\wordRate.out
..............\...\...\....\...\词典.txt
..............\bin\org\tseg\seg
..............\src\org\tseg\seg
..............\bin\org\tseg
..............\src\org\tseg
..............\bin\org
..............\src\org
..............\.settings
..............\bin
..............\lib
..............\src
ChineseSegment