搜索资源列表
pugxml_src
- This XML parser segments a given string in situ (like strtok), performing scanning/tokenization, and parsing in a single pass.
strtk 识别文本文件中的记号
- 识别文本文件中的记号
pugxml_src
- This XML parser segments a given string in situ (like strtok), performing scanning/tokenization, and parsing in a single pass. -This XML parser segments a given string in situ (like strtok), performing scanning/tokenizat
Create_token
- Tokenization. create terms word from multiple document in one txt file. 2 output. DICTIONARY.txt contain term related and POSTING.txt contain term descr iption
JTextPro-1.0.tar
- JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phra
Perl
- perl 实现数据分类 tokenization,抽取feature selection,文件分类documentation classification-The project’s goal is to provide an application to provide a brief list for a set of books in xml format then maybe people can through t
vtd-xml-2.6-java-src
- VTD-XML 是一种基于 Java* 的新型开放源代码 XML 处理 API,能够解决当前 XML 处理模型的许多问题。此方案目前属于 Sourceforge* 一部分,可在此处*找到。通过本演示*,您将熟悉这些基本的概念。仅凭这一点,我们还不能认为 VTD-XML 是专门为此而设计的,因为从第一步——断词(tokenization)开始,它就引入了大量优化技术。-For XML files that don t declare e