搜索资源列表
SegmentRAM
- 1.基于luncene倒排索引格式的高性能索引数据访问接口。 2. 实现若干中文分词的的分词算法。-1. Based on luncene inverted index format of high-performance data access interface Index. 2. A number of Chinese word segmentation algorithm of.
invert10_31
- 中文信息处理,建立文件的倒排表,形成词频链表-Chinese information processing, document the inverted table and form Frequency List
srcfileread_10_31
- 对文件进行操作,对索引倒排表中的文件进行相应的操作-the documents operation, the inverted index table of the corresponding documents to operate
invertefile(chinese)
- 首先对中文文档建立倒排文档,然后根据建立的倒排文档,按照此检索算法对文档进行检索
invertefile(english)
- 首先对英文文档建立倒排文档,然后根据建立的倒排文档,按照此检索算法对文档进行检索
TextClassification_wbfl_sn
- 整个实验是在Windows环境下使用delphi完成的。选取了600篇文档,数据集共分教育,商业与经济,计算机与因特网,娱乐与休闲,自然科学5个类别, 教育类包括31篇文档, 商业与经济类有93篇文档, 计算机与因特网102篇文档, 娱乐与休闲166篇文档, 自然科学有208篇文档。 目录“DataSet”:RawText中的文本分词后保存在DataSet目录。 数据表“WordsTable”:保
倒排索引的实现
超高效倒排索引的实现算法
倒排近似查找
- 用倒排索引进行近似查找,根据ED值...
SegmentRAM
- 1.基于luncene倒排索引格式的高性能索引数据访问接口。 2. 实现若干中文分词的的分词算法。-1. Based on luncene inverted index format of high-performance data access interface Index. 2. A number of Chinese word segmentation algorithm of.
invertefile(chinese)
- 首先对中文文档建立倒排文档,然后根据建立的倒排文档,按照此检索算法对文档进行检索-First of all, the establishment of the Chinese document inverted file, and then in accordance with the establishment of the inverted file, in accordance with the retrieval algorit
invertefile(english)
- 首先对英文文档建立倒排文档,然后根据建立的倒排文档,按照此检索算法对文档进行检索-First of all English documents to establish inverted file, and then in accordance with the establishment of the inverted file, in accordance with the retrieval algorithms for docu
InverseIndex
- 使用B+树实现文件倒排索引,查找关键词染色,根据出现频率排序-The use of B+ Tree file inverted index to find keywords staining, according to frequency of occurrence order
hlink.031023-1010.tar
- 用c++写的搜索引擎中建索引的程序,实现了倒排索引。-With c++ Write Zhongjian search engine indexing process, the realization of the inverted index.
IndexDemo
- 倒排表构建实例,很简单。供学习使用。 一学生信息为示例。-Inverted Construction of examples of form, is very simple. For learning to use. A student information for the sample.
inverted_index
- 简单的文件倒排实现,搜索引擎实现的步骤之一。大量使用STL,实现简单容易理解。效率一般。-Simple realization of inverted files, search engines to achieve one of the steps. Extensive use of STL, the realization of simple and easy to understand. Efficiency in general
ir
- 本系统实现了分词和倒排索引,分词采用正向最大匹配,-The system achieved a sub-word and the inverted index, the biggest being the use of sub-word match,
VSM
- 向量空间模型算法,给定一个经过分词的文档集,可以输出向量空间模型、特征词典、倒排索引表等功能,很经典的VSM算法源代码-Vector space model algorithm, given a segmentation of the document set, you can output vector space model, the characteristics of dictionaries, inverted index t
EasyXSpider
- 一个Linux下的爬虫,倒排序索引,多条件检索,二元切词以及Google PageRank算法的示例程序。包括CGI的查询界面。Cool!~-Linux under a reptile, invert index, multi-condition searches, Chinese binary segmentation, as well as an example of Google PageRank algorithm proced
1
- 建立倒排索引的程序雏型.主要采用链表的数据结构.-Inverted index to establish the procedures for the prototype. The main use of the linked list data structure.
irCode
- 倒排索引的实现 通过倒排索引的方式实现文档集合上的搜索功能(The realization of inverted index)