位置:成果数据库 > 期刊 > 期刊详情页
  • ISSN号:1000-1239
  • 期刊名称:计算机研究与发展
  • 时间:2012.11.11
  • 页码:2376-2382
  • 分类:TP391.1[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]中国科学院计算技术研究所,北京100190, [2]中国科学院大学,北京100049
  • 相关基金:基金项目:国家自然科学基金重点项目(60933005);国家“八六三”高技术研究发展计划基金项目(2010AA012500);国家自然科学基金项目(60803085)
  • 相关项目:Web搜索与挖掘的新理论和新方法—支持舆情监控的Web搜索与挖掘的理论与方法研究



A key problem of sentiment analysis is to determine the polarity of a review is positive (thumbs up) or negative (thumbs down). Unlike topic-based text classification, where a high accuracy can be achieved, the sentiment classification is a hard and complicated task. One of the main challenges for document-level sentiment classification is that not every part of the document is equally informative for inferring the polarity of the whole document. Thus, makinga distinction between key sentences and trivial sentences will be helpful to improve the sentiment classification performance. Wc divide a document into key sentences and detailed sentences. Key sentence is usually brief but discriminative while detailed sentences are diverse and ambiguous. For key sentence extraction, our approach takes three attributes into account: sentiment attribute, position attribute and special words attribute. To make use of the discrepancy and complementarity of key sentences and detailed sentences, we incorporate key sentences and detailed sentences in supervised and semi supervised learning. In supervised sentiment classification, a classifier combination approach is adopted because the original document is divided into two different and complementary parts; in semi-supervised sentiment classification, a co-training algorithm is proposed to incorporate unlabeled data for sentiment classification better than the baseline Experimental results across eight domains show that our method and the key sentence extraction is effective.

  • 《计算机研究与发展》
  • 中国科技核心期刊
  • 主管单位:中国科学院
  • 主办单位:中国科学院计算技术研究所
  • 主编:徐志伟
  • 地址:北京市科学院南路6号中科院计算所
  • 邮编:100190
  • 邮箱:crad@ict.ac.cn
  • 电话:010-62620696 62600350
  • 国际标准刊号:ISSN:1000-1239
  • 国内统一刊号:ISSN:11-1777/TP
  • 邮发代号:2-654
  • 获奖情况:
  • 2001-2007百种中国杰出学术期刊,2008中国精品科...,中国期刊方阵“双效”期刊
  • 国内外数据库收录:
  • 俄罗斯文摘杂志,荷兰文摘与引文数据库,美国工程索引,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版),中国北大核心期刊(2000版)
  • 被引量:40349