文本情感倾向分析是意见挖掘和情感文摘中的一个重要环节,而在情感倾向分析中涉及到的是主观性文本,这就需要进行主客观文本分类。当前的主客观文本分类方法主要是基于特征词典的概率统计方法,并没有考虑特征之间的语法与语义关系。针对该问题,该文提出一种基于隐马尔可夫模型(HMM)的主观句识别方法。该方法首先从训练语料中抽取具有明显分类效果的七类主客观特征,然后每个句子应用HMM进行特征角色类别标注,并依据标注的结果计算句子的权重,最终识别主观句。该方法在第六届中文倾向性分析评测任务中能够有效地识别主观句。
The current subjective and objective text classification methods are mainly based on statistical model over the feature lexicon,which didn't take into account the syntax and semantic relationships between features.The paper proposes a Chinese subjective sentence recognition based on Hidden Markov Model.In this method,seven kinds of subjective and objective features for classification are extracted tagged among each sentence by HMM.The subjective sentences are decided by the importance of features and syntactic structure of sentences.The method is examined in the task of COAE2014 for its effeiciency.