特征—观点对的抽取是观点挖掘中非常重要的研究课题之一。该文首先利用依存语法对句子进行了依存分析,在此基础上研究了旅游评论文本中特征-观点对的抽取。利用词对间的依存关系,构建了获取含有特征和观点词语的组块规则,并设计了候选特征的识别算法和特征—观点对的抽取算法。该文对山西旅游景点评论语料进行了实验,结果表明,特征—观点对的抽取整体的F1值达到了87.10%,验证了方法的有效性。
Feature-Opinion Extraction is one of the key researches in the area of opinion mining,bearing significant affect on the performance of opinion orientation identification.This paper proposes an approach to mining evaluation features and opinions based on the dependency information and the chunk information.With the dependency relation between word and word,we construct the rules to obtain chunks containing the evaluation feature and opinion and further design three algorithms to get the candidate evaluation features and candidate feature-opinion pairs.Experimental results show that the whole F1-measure can achieve 87.10% in scenic spots reviews of Shanxi,proving effectiveness of the proposed method.