文件名称:TextAnalysis
介绍说明--下载内容均来自于网络,请自行研究使用
TextAnalysis系统及算法设计
输入为ICTCLAS分词后的词语结构信息,对每个词语的词性进行判断。
1. 如果不存在词性,则跳过这次循环。用来跳过一些语气助词等无意义的信息。
2. 由于每个句子都有几个子句,而每个子句都是一个独立的主谓宾结构,所以系统将子句通过标点符号来分隔。最后将所以子句的总情感权值相加得到总句的情感权值。
3. 在对字典的预处理阶段,系统对不同程度的词语赋予了不同的权值。为了提高处理程序的效率,系统只分析对体现语言情感有较大作用的词性(包括形容词、副词、动词、名词、数词)。
4. 对于副词,需要特殊处理。首先副词是加强语气的作用,如“非常好”,“非常糟糕”。此时句子的情感权值就需要用到副词乘以原来的权值。另外,如“非常非常的不好”,这是就需要用副词来乘以副词了。对应函数sentenceAnalysis。
5. 对于字典中词语权值的说明。对于否定词语,系统设置为-1,即与原来的权值相反,这样也满足双重或多重否定的要求。对于不同的程度词语,对应的分为6个层次,分别赋予不同的权值,以表示不同语气的情感权值的强弱。对于褒义词和贬义词,系统简单的赋予1和-1的权值。对应函数sentenceAnalysis。
-Enter the configuration information for the word after word ICTCLAS , judge for the part of speech of each word .
1 If there is no speech , skip this cycle. Used to skip some of the modal particle and other meaningless information .
2 Since each sentence has several clauses , each clause is a separate subject-verb-object structure , so the system will be separated by punctuation clause . Finally, it is the emotional weight of the total obtained by adding the clause emotional value of the total sentence .
3 in the dictionary for the pretreatment phase, the system for different levels of words given a different weight . In order to improve the efficiency of the process , the system only analyzes the emotional language to reflect the greater role of speech ( including adjectives, adverbs , verbs , nouns , numerals ) .
4 For adverbs , require special handling. First, the adverb is to strengthen the role of tone , such as " very good" , "very bad ." At this point the emotional weight of
输入为ICTCLAS分词后的词语结构信息,对每个词语的词性进行判断。
1. 如果不存在词性,则跳过这次循环。用来跳过一些语气助词等无意义的信息。
2. 由于每个句子都有几个子句,而每个子句都是一个独立的主谓宾结构,所以系统将子句通过标点符号来分隔。最后将所以子句的总情感权值相加得到总句的情感权值。
3. 在对字典的预处理阶段,系统对不同程度的词语赋予了不同的权值。为了提高处理程序的效率,系统只分析对体现语言情感有较大作用的词性(包括形容词、副词、动词、名词、数词)。
4. 对于副词,需要特殊处理。首先副词是加强语气的作用,如“非常好”,“非常糟糕”。此时句子的情感权值就需要用到副词乘以原来的权值。另外,如“非常非常的不好”,这是就需要用副词来乘以副词了。对应函数sentenceAnalysis。
5. 对于字典中词语权值的说明。对于否定词语,系统设置为-1,即与原来的权值相反,这样也满足双重或多重否定的要求。对于不同的程度词语,对应的分为6个层次,分别赋予不同的权值,以表示不同语气的情感权值的强弱。对于褒义词和贬义词,系统简单的赋予1和-1的权值。对应函数sentenceAnalysis。
-Enter the configuration information for the word after word ICTCLAS , judge for the part of speech of each word .
1 If there is no speech , skip this cycle. Used to skip some of the modal particle and other meaningless information .
2 Since each sentence has several clauses , each clause is a separate subject-verb-object structure , so the system will be separated by punctuation clause . Finally, it is the emotional weight of the total obtained by adding the clause emotional value of the total sentence .
3 in the dictionary for the pretreatment phase, the system for different levels of words given a different weight . In order to improve the efficiency of the process , the system only analyzes the emotional language to reflect the greater role of speech ( including adjectives, adverbs , verbs , nouns , numerals ) .
4 For adverbs , require special handling. First, the adverb is to strengthen the role of tone , such as " very good" , "very bad ." At this point the emotional weight of
(系统自动生成,下载前可以参看下载内容)
下载文件列表
TextAnalysis\Debug\TextAnalysis.exe
............\.....\TextAnalysis.exp
............\.....\TextAnalysis.ilk
............\.....\TextAnalysis.lib
............\.....\TextAnalysis.pdb
............\ipch\textanalysis-511e494c\textanalysis-c51ddaf5.ipch
............\TextAnalysis\20131121.err
............\............\configure.xml
............\............\Data\BIG2GBK.map
............\............\....\BIG5.pdat
............\............\....\BIG5.wordlist
............\............\....\BiWord.big
............\............\....\charset.type
............\............\....\Configure.xml
............\............\....\CoreDict.pdat
............\............\....\CoreDict.pos
............\............\....\CoreDict.unig
............\............\....\FieldDict.pdat
............\............\....\FieldDict.pos
............\............\....\GBK.pdat
............\............\....\GBK.wordlist
............\............\....\GBK2BIG.map
............\............\....\GBK2GBKC.map
............\............\....\GBK2UTF.map
............\............\....\GBKA.pdat
............\............\....\GBKA.wordlist
............\............\....\GBKA2UTF.map
............\............\....\GBKC.pdat
............\............\....\GBKC.wordlist
............\............\....\GBKC2GBK.map
............\............\....\GranDict.pdat
............\............\....\GranDict.pos
............\............\....\ICTPOS.map
............\............\....\NewWord.lst
............\............\....\NLPIR.ctx
............\............\....\NLPIR.user
............\............\....\NLPIR_First.map
............\............\....\nr.ctx
............\............\....\nr.fsa
............\............\....\nr.role
............\............\....\PKU.map
............\............\....\PKU_First.map
............\............\....\UserDict.pdat
............\............\....\UTF2GBK.map
............\............\....\UTF2GBKA.map
............\............\....\UTF8.pdat
............\............\....\UTF8.wordlist
............\............\.ebug\CL.read.1.tlog
............\............\.....\CL.write.1.tlog
............\............\.....\link-cvtres.read.1.tlog
............\............\.....\link-cvtres.write.1.tlog
............\............\.....\link.read.1.tlog
............\............\.....\link.write.1.tlog
............\............\.....\mt.read.1.tlog
............\............\.....\mt.write.1.tlog
............\............\.....\rc.read.1.tlog
............\............\.....\rc.write.1.tlog
............\............\.....\stdafx.obj
............\............\.....\TextAnalysis.Build.CppClean.log
............\............\.....\TextAnalysis.exe.embed.manifest
............\............\.....\TextAnalysis.exe.embed.manifest.res
............\............\.....\TextAnalysis.exe.intermediate.manifest
............\............\.....\TextAnalysis.lastbuildstate
............\............\.....\TextAnalysis.log
............\............\.....\TextAnalysis.obj
............\............\.....\TextAnalysis.pch
............\............\.....\TextAnalysis.write.1.tlog
............\............\.....\TextAnalysis_manifest.rc
............\............\.....\vc100.idb
............\............\.....\vc100.pdb
............\............\dic\bad.txt
............\............\...\good.txt
............\............\...\level1.txt
............\............\...\level2.txt
............\............\...\level3.txt
............\............\...\level4.txt
............\............\...\level5.txt
............\............\...\level6.txt
............\............\...\not.txt
............\............\libmysql.dll
............\............\libmysql.lib
............\............\NLPIR.dll
............\............\NLPIR.h
............\............\NLPIR.lib
............\............\ReadMe.txt
............\............\stdafx.cpp
............\............\stdafx.h
............\............\targetver.h
............\............\TextAnalysis.cpp
............\............\TextAnalysis.vcxproj
............\............\TextAnalysis.vcxproj.filters
............\............\TextAnalys