搜索资源列表
分词程序代码
- 分词程序的源代码-Word program's source code
庖丁分词工具
- 一个流行的java分词程序。
分词程序代码
- 分词程序的源代码-Word program's source code
MFC查词典、分词、词频统计程序
- MFC编程,功能是查词典(用户可自己导入文本),分词,统计词频,还可以保存结果!我们MFC课的期末作业,强烈推荐!-MFC programming function is to check dictionary (users can import their own version), participle, statistical, frequency, the results can be saved! We MFC class a
百度分词词库
- 据说是百度以前用的中文分词词典,希望对大家有一点帮助哈,快下快下-allegedly Baidu before the Chinese word dictionaries, we hope to have a bit of help to Kazakhstan, where fast under fast!
segment
- 基于文本文件的分词程序,可以将指定文件名的文本文档按照词典自动划分出单词.-text-based documents, the sub-term process, which would be designated the File Name text documents with automatic divided dictionary word.
framework
- 基于动态规划的中文分词程序,用vc写的,便于扩展。-based on dynamic programming of the Chinese word segmentation procedures using vc write, easy expansion.
SegtoFile
- 为自然语言处理领域的中文分词程序,可将分词内容写入文件。-natural language processing area of the Chinese word segmentation procedures can be written in word document.
WordClassify
- 一个分词程序,c代码,有很详细的注释,便于阅读-A segmentation procedure, c code, have very detailed notes, easy-to-read
zhongwenfenci
- 讲述面向信息检索的中文分词程序的PDF文档,-For information retrieval on Chinese word segmentation process PDF documents,
ICTCLAS
- 中科院的分词程序 可得到比较满意的分词结果 正确率较高-Chinese Academy of Sciences of the segmentation process can be relatively satisfied with the results of the sub-word accuracy rate higher
WordSeg
- 这是一个中文分词程序。用户将中文文件(.txt)打开,点分词后可看到分词结果。开源。-This is a Chinese word segmentation process. Users will be Chinese documents (. Txt) open, point after the word segmentation results can be seen. Open source.
fenci
- java版的分词程序,可以灵活生成添加字典。-java version of the segmentation procedure, you can add flexibility to generate the dictionary.
autosplit
- 中文自动分类分词程序,已经打包成dll,里面有说明-Automatic classification of Chinese word segmentation procedures have been packaged into a dll, there are notes
fenci
- 一个简单的分词程序,里面有代码和词库,编译连接后在命令行里运行-A simple segmentation procedure, which has code and thesaurus, the compiler to connect the command line after the run
windows_c_32
- 中文分词程序,有中国科学院开发,用于中文文本分词-ictat
Segmentation
- 用HMM实现的中文分词程序,用C#实现的。-HMM to achieve with the Chinese word segmentation
WordSegment
- 基于字符串匹配的中文分词程序,C++版。 结果以文件显示。-String matching based on the Chinese word segmentation procedures, C++ version. To document the results.
mmseg
- 基于双数组trie的分词程序,分词速度20MB/S,能够支持GBK、UTF8编码-Double array trie-based sub-word procedure word speed 20MB/S, can support GBK, UTF8 encoding
WordPartation2
- 中文分词程序 利用最大匹配算法 支持GB2312编码格式的文件-Chinese word segmentation procedure using the maximum matching algorithm to support GB2312 encoding format of the file