意见挖掘已成为近年来的热点问题,该文针对COAE2009评测中的意见挖掘任务的一项子任务——评价对象抽取进行了研究。首先提出利用核心句进行学习的思想,继而确定了10种句法关系作为语言特征,将原始句和核心句分别基于词、词性和句法关系利用条件随机场模型进行学习和比较,在后期又利用二次学习的方式进一步提高了抽取性能。实验取得了相对不错的抽取效果,证明我们提出的方法是可行的,且具有一定的应用价值。
Opinion mining has become a hot topic in recent years.We focus on one of the sub-tasks of opinion mining in COAE2009 and propose a theory of learning from nuclear sentences.Ten types of syntactic relations are defined as features,and the Conditional Random Fields(CRF) model is applied to analyze and compare the original sentence against the nuclear sentence in terms of words,part-of-speech(POS) and syntactic relations.Thenthe CRFs re-learning is carried out to further enhance the extraction performance.Experiment result exhibits measurable improvement and therefore proves the feasibility and value of this method.