文件名称:chentian.nutch
介绍说明--下载内容均来自于网络,请自行研究使用
实现了基于词库的nutch中文分词,主要修改了其中的.jj文件等-realized based on the thesaurus nutch Chinese word, the main change of them. Jj documents
(系统自动生成,下载前可以参看下载内容)
下载文件列表
chentian.nutch
..............\AnalyzerFactory.java
..............\build.xml
..............\CharStream.java
..............\CJKTokenizer.java
..............\com
..............\...\xjt
..............\...\...\nlp
..............\...\...\...\word
..............\...\...\...\....\ICTCLAS.java
..............\...\...\...\....\Sentence.java
..............\...\...\...\....\SplitWord.java
..............\...\...\...\....\ThreadTest.java
..............\...\...\...\....\Word.java
..............\...\...\...\....\Word.jbx
..............\CommonGrams.java
..............\FastCharStream.java
..............\NutchAnalysis.java
..............\NutchAnalysis.jj
..............\NutchAnalysisConstants.java
..............\NutchAnalysisTokenManager.java
..............\NutchAnalyzer.java
..............\NutchDocumentAnalyzer.java
..............\NutchDocumentTokenizer.java
..............\ParseException.java
..............\Token.java
..............\TokenManager.java
..............\TokenMgrError.java
..............\AnalyzerFactory.java
..............\build.xml
..............\CharStream.java
..............\CJKTokenizer.java
..............\com
..............\...\xjt
..............\...\...\nlp
..............\...\...\...\word
..............\...\...\...\....\ICTCLAS.java
..............\...\...\...\....\Sentence.java
..............\...\...\...\....\SplitWord.java
..............\...\...\...\....\ThreadTest.java
..............\...\...\...\....\Word.java
..............\...\...\...\....\Word.jbx
..............\CommonGrams.java
..............\FastCharStream.java
..............\NutchAnalysis.java
..............\NutchAnalysis.jj
..............\NutchAnalysisConstants.java
..............\NutchAnalysisTokenManager.java
..............\NutchAnalyzer.java
..............\NutchDocumentAnalyzer.java
..............\NutchDocumentTokenizer.java
..............\ParseException.java
..............\Token.java
..............\TokenManager.java
..............\TokenMgrError.java