文件名称:IKAnalyzer3.2.8-source
- 所属分类:
- JSP源码/Java
- 资源属性:
- [Java] [源码]
- 上传时间:
- 2012-11-26
- 文件大小:
- 1.14mb
- 下载次数:
- 0次
- 提 供 者:
- 陈**
- 相关连接:
- 无
- 下载说明:
- 别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容均来自于网络,请自行研究使用
IKAnalyzer的源码包,实现中文分词功能,(1) 采用了特有的“正向迭代最细粒度切分算法”,具有60万字/ 秒的高速处理能力。
(2) 采用了多子处理器分析模式,支持:英文字母( IP 地址、Email、URL )、数字(日期,常用中文数量词,罗马数字,科学计数法),中文词汇(姓名、地名处理)等分词处理 。
(3) 优化的词典存储,更小的内存占用。支持用户词典扩展定义
(4) 针对 Lucene 全文检索优化的查询分析器 IKQueryParser;采用歧义分析算法优化查询关键字的搜索排列组合,能极大的提高 Lucene 检索的命中率。-IKAnalyzer the source package to achieve Chinese word function, (1) using the unique " forward iteration the most fine-grained segmentation algorithm" , with 60 million characters/second high-speed processing capability. (2) The analysis of multi-mode sub-processor support: letters (IP address, Email, URL), numbers (dates, commonly used in Chinese numeral, roman numerals, scientific notation), Chinese vocabulary (name, place name treatment) segmentation processing. (3) optimization of the dictionary storage, a smaller memory footprint. Support the extended definition of the user dictionary (4) optimized for the Lucene full-text search query analyzer IKQueryParser by ambiguity of keyword search algorithm to optimize query permutations and combinations, can greatly improve the Lucene search hits.
(2) 采用了多子处理器分析模式,支持:英文字母( IP 地址、Email、URL )、数字(日期,常用中文数量词,罗马数字,科学计数法),中文词汇(姓名、地名处理)等分词处理 。
(3) 优化的词典存储,更小的内存占用。支持用户词典扩展定义
(4) 针对 Lucene 全文检索优化的查询分析器 IKQueryParser;采用歧义分析算法优化查询关键字的搜索排列组合,能极大的提高 Lucene 检索的命中率。-IKAnalyzer the source package to achieve Chinese word function, (1) using the unique " forward iteration the most fine-grained segmentation algorithm" , with 60 million characters/second high-speed processing capability. (2) The analysis of multi-mode sub-processor support: letters (IP address, Email, URL), numbers (dates, commonly used in Chinese numeral, roman numerals, scientific notation), Chinese vocabulary (name, place name treatment) segmentation processing. (3) optimization of the dictionary storage, a smaller memory footprint. Support the extended definition of the user dictionary (4) optimized for the Lucene full-text search query analyzer IKQueryParser by ambiguity of keyword search algorithm to optimize query permutations and combinations, can greatly improve the Lucene search hits.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
IKAnalyzer3.2.8 source
......................\src
......................\...\ext_stopword.dic
......................\...\IKAnalyzer.cfg.xml
......................\...\org
......................\...\...\wltea
......................\...\...\.....\analyzer
......................\...\...\.....\........\cfg
......................\...\...\.....\........\...\Configuration.java
......................\...\...\.....\........\Context.java
......................\...\...\.....\........\dic
......................\...\...\.....\........\...\Dictionary.java
......................\...\...\.....\........\...\DictSegment.java
......................\...\...\.....\........\...\Hit.java
......................\...\...\.....\........\...\main.dic
......................\...\...\.....\........\...\preposition.dic
......................\...\...\.....\........\...\quantifier.dic
......................\...\...\.....\........\...\stopword.dic
......................\...\...\.....\........\...\suffix.dic
......................\...\...\.....\........\...\surname.dic
......................\...\...\.....\........\help
......................\...\...\.....\........\....\CharacterHelper.java
......................\...\...\.....\........\IKSegmentation.java
......................\...\...\.....\........\Lexeme.java
......................\...\...\.....\........\lucene
......................\...\...\.....\........\......\IKAnalyzer.java
......................\...\...\.....\........\......\IKQueryParser.java
......................\...\...\.....\........\......\IKSimilarity.java
......................\...\...\.....\........\......\IKTokenizer.java
......................\...\...\.....\........\sample
......................\...\...\.....\........\......\IKAnalyzerDemo.java
......................\...\...\.....\........\seg
......................\...\...\.....\........\...\CJKSegmenter.java
......................\...\...\.....\........\...\ISegmenter.java
......................\...\...\.....\........\...\LetterSegmenter.java
......................\...\...\.....\........\...\QuantifierSegmenter.java
......................\...\...\.....\........\solr
......................\...\...\.....\........\....\IKTokenizerFactory.java
......................\test
......................\....\CH_stopword.dic
......................\....\mydict.dic
......................\....\org
......................\....\...\wltea
......................\....\...\.....\analyzer
......................\....\...\.....\........\test
......................\....\...\.....\........\....\CfgTester.java
......................\....\...\.....\........\....\CharacterTest.java
......................\....\...\.....\........\....\DictionaryTester.java
......................\....\...\.....\........\....\IKTokenerTest.java
......................\....\...\.....\........\....\NumberSegmenter.java
......................\....\...\.....\........\....\SegmentorTester.java
......................\....\...\.....\........\....\SimpleQuantifierSegmenter.java
......................\....\...\.....\........\....\StandardAnalyzerTest.java
......................\src
......................\...\ext_stopword.dic
......................\...\IKAnalyzer.cfg.xml
......................\...\org
......................\...\...\wltea
......................\...\...\.....\analyzer
......................\...\...\.....\........\cfg
......................\...\...\.....\........\...\Configuration.java
......................\...\...\.....\........\Context.java
......................\...\...\.....\........\dic
......................\...\...\.....\........\...\Dictionary.java
......................\...\...\.....\........\...\DictSegment.java
......................\...\...\.....\........\...\Hit.java
......................\...\...\.....\........\...\main.dic
......................\...\...\.....\........\...\preposition.dic
......................\...\...\.....\........\...\quantifier.dic
......................\...\...\.....\........\...\stopword.dic
......................\...\...\.....\........\...\suffix.dic
......................\...\...\.....\........\...\surname.dic
......................\...\...\.....\........\help
......................\...\...\.....\........\....\CharacterHelper.java
......................\...\...\.....\........\IKSegmentation.java
......................\...\...\.....\........\Lexeme.java
......................\...\...\.....\........\lucene
......................\...\...\.....\........\......\IKAnalyzer.java
......................\...\...\.....\........\......\IKQueryParser.java
......................\...\...\.....\........\......\IKSimilarity.java
......................\...\...\.....\........\......\IKTokenizer.java
......................\...\...\.....\........\sample
......................\...\...\.....\........\......\IKAnalyzerDemo.java
......................\...\...\.....\........\seg
......................\...\...\.....\........\...\CJKSegmenter.java
......................\...\...\.....\........\...\ISegmenter.java
......................\...\...\.....\........\...\LetterSegmenter.java
......................\...\...\.....\........\...\QuantifierSegmenter.java
......................\...\...\.....\........\solr
......................\...\...\.....\........\....\IKTokenizerFactory.java
......................\test
......................\....\CH_stopword.dic
......................\....\mydict.dic
......................\....\org
......................\....\...\wltea
......................\....\...\.....\analyzer
......................\....\...\.....\........\test
......................\....\...\.....\........\....\CfgTester.java
......................\....\...\.....\........\....\CharacterTest.java
......................\....\...\.....\........\....\DictionaryTester.java
......................\....\...\.....\........\....\IKTokenerTest.java
......................\....\...\.....\........\....\NumberSegmenter.java
......................\....\...\.....\........\....\SegmentorTester.java
......................\....\...\.....\........\....\SimpleQuantifierSegmenter.java
......................\....\...\.....\........\....\StandardAnalyzerTest.java