文件名称:SearchEngine
介绍说明--下载内容均来自于网络,请自行研究使用
dySE 是个开源的 Java 小型搜索引擎。该搜索引擎分为三个模块:爬虫模块、预处理模块和搜索模块。其中详细阐述了: 多线程页面爬取、正文内容提取、文本提取、分词、索引建立、快照等功能的实现。-dySE is an open source Java small search engines. The search engine is divided into three modules: crawler module, pretreatment module and search module. Which elaborated: Multithreaded page crawling, text content extraction, text extraction, segmentation, indexing, snapshots and other functions.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
SearchEngine
............\.classpath
............\.mymetadata
............\.project
............\.settings
............\.........\.jsdtscope
............\.........\com.genuitec.eclipse.migration.prefs
............\.........\org.eclipse.jdt.core.prefs
............\.........\org.eclipse.wst.jsdt.ui.superType.container
............\.........\org.eclipse.wst.jsdt.ui.superType.name
............\Dictionary
............\..........\stopWord.txt
............\..........\wordlist.txt
............\lib
............\...\mysql-connector-java-5.1.7-bin.jar
............\Raws
............\....\RAW__0.txt
............\....\RAW__1.txt
............\....\RAW__2.txt
............\....\RAW__3.txt
............\....\RAW__4.txt
............\SearchEngine.war
............\src
............\...\configure
............\...\configure.properties
............\...\.........\Configuration.java
............\...\core
............\...\....\preprocess
............\...\....\..........\DictReader.java
............\...\....\..........\DictSegment.java
............\...\....\..........\forwardIndex
............\...\....\..........\............\ForwardIndex.java
............\...\....\..........\index
............\...\....\..........\.....\originalPageGetter.java
............\...\....\..........\.....\RawsAnalyzer.java
............\...\....\..........\invertedIndex
............\...\....\..........\.............\InvertedIndex.java
............\...\....\query
............\...\....\.....\Response.java
............\...\....\spider
............\...\....\......\Dispatcher.java
............\...\....\......\Gather.java
............\...\....\......\Spider.java
............\...\....\......\URLClient.java
............\...\....\......\WebAnalyzer.java
............\...\....\util
............\...\....\....\DBConnection.java
............\...\....\....\HtmlParser.java
............\...\....\....\MD5.java
............\...\....\....\Page.java
............\...\....\....\Result.java
............\...\....\....\ResultGenerator.java
............\...\....\....\StopWordsMerger.java
............\...\META-INF
............\...\........\MANIFEST.MF
............\...\test
............\...\....\testDBConnection.java
............\...\....\testDictSegment.java
............\...\....\testMySql.java
............\...\....\testOffset.java
............\...\....\testParseHtml.java
............\...\....\testRawsAnalyzer.java
............\...\....\testSougouOffset.java
............\...\....\testStringTokenizer.java
............\...\....\testSubFile.java
............\...\....\testSubString.java
............\WebRoot
............\.......\dySE-logo.jpg
............\.......\index.jsp
............\.......\META-INF
............\.......\........\MANIFEST.MF
............\.......\search.jsp
............\.......\Thumbs.db
............\.......\WEB-INF
............\.......\.......\classes
............\.......\.......\.......\configure
............\.......\.......\.......\configure.properties
............\.......\.......\.......\.........\Configuration.class
............\.......\.......\.......\core
............\.......\.......\.......\....\preprocess
............\.......\.......\.......\....\..........\DictReader.class
............\.......\.......\.......\....\..........\DictSegment.class
............\.......\.......\.......\....\..........\forwardIndex
............\.......\.......\.......\....\..........\............\ForwardIndex.class
............\.......\.......\.......\....\..........\index
............\.......\.......\.......\....\..........\.....\originalPageGetter.class
............\.......\.......\.......\....\..........\.....\RawsAnalyzer.class
............\.......\.......\.......\....\..........\invertedIndex
............\.......\.......\.......\....\..........\.............\InvertedIndex.class
............\.......\.......\.......\....\query
............\.......\.......\.......\....\.....\Response.class
............\.......\.......\.......\....\spider
............\.......\.......\.......\....\......\Dispatcher.class
............\.......\.......\.......\....\......\Gather.class
............\.......\.......\......