位置:成果数据库 > 期刊 > 期刊详情页
基于马尔可夫逻辑的中文零指代消解
  • ISSN号:1000-1239
  • 期刊名称:《计算机研究与发展》
  • 时间:0
  • 分类:TP301[自动化与计算机技术—计算机系统结构;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]北京大学计算语言学研究所,北京100871, [2]计算语言学教育部重点实验室北京大学,北京100871
  • 相关基金:国家“八六三”高技术研究发展计划基金项目(2015AA015402); 国家自然科学基金项目(61370117,61333018); 国家社会科学基金重大项目(12&ZD227)
中文摘要:

中文零指代消解问题包括零指代项的识别和零指代项的消解2个相互关联的子任务.传统的方法在解决该问题时,往往不考虑2个子任务间的关联关系,比如识别出的零指代项必须被消解以及发生消解的必须是零指代项等约束.基于马尔可夫逻辑网络模型可以将零指代项的识别和零指代项的消解2个子任务融合在统一的机器学习框架下进行联合推断与联合学习,采用局部规则分别针对零指代项的识别和消解进行预测,采用全局规则描述这2个子任务间的关联关系.基于OntoNotes3.0的中文数据集上的实验结果显示,基于马尔可夫逻辑网络的联合学习模型相比于独立学习模型以及多个baseline方法能够获得更好的实验效果.

英文摘要:

Chinese zero anaphora resolution includes two subtasks:zero pronoun detection and zero anaphora resolution,which are correlated with each other.Zero pronoun detection means to recognize all the zero anaphors in a given text,which mainly include null subject or null object,and exist widely in Chinese,Japanese and Italian.Zero anaphora resolution means to determine the antecedent for each recognized zero anaphor,which has already appeared as a noun,pronoun or common noun phrase before the detected zero anaphora in the previous text.Traditional methods to solve Chinese zero anaphora resolution problem generally employ some common-used learning features to construct independent classifiers for zero pronoun detection and zero anaphora resolution,but it cannot capture association relationship between these two subtasks,e.g.recognized zero anaphora must be resolved or the one to be resolved must be zero anaphora and so on.In our method,these two subtasks are combined into a unified machine learning framework with Markov logic to make joint inference and joint learning.We use local formulas to describe zero pronoun detection and zero anaphora resolution respectively,and use global formulas to represent the association relationship between these two subtasks.We find that joint learning model which makes learning with inference can acquire more effective feature weights than independent learning model which just makes learning without inference.Experimental results on OntoNotes3.0Chinese dataset show that our joint learning model can achieve better results compared with independent learning model and other baseline methods.

同期刊论文项目
同项目期刊论文
期刊信息
  • 《计算机研究与发展》
  • 中国科技核心期刊
  • 主管单位:中国科学院
  • 主办单位:中国科学院计算技术研究所
  • 主编:徐志伟
  • 地址:北京市科学院南路6号中科院计算所
  • 邮编:100190
  • 邮箱:crad@ict.ac.cn
  • 电话:010-62620696 62600350
  • 国际标准刊号:ISSN:1000-1239
  • 国内统一刊号:ISSN:11-1777/TP
  • 邮发代号:2-654
  • 获奖情况:
  • 2001-2007百种中国杰出学术期刊,2008中国精品科...,中国期刊方阵“双效”期刊
  • 国内外数据库收录:
  • 俄罗斯文摘杂志,荷兰文摘与引文数据库,美国工程索引,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版),中国北大核心期刊(2000版)
  • 被引量:40349