东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于时间片段和主题片段的时间关系识别

ISSN号：1671-9352
期刊名称：《山东大学学报：理学版》
时间：0
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]山西大学计算机与信息技术学院,山西太原030006, [2]山西大学计算智能与中文信息处理教育部重点实验室,山西太原030006
相关基金：国家高技术研究发展计划（863计划）项目（2015AA015407）; 国家自然科学基金资助项目（61673248）;国家自然科学基金青年项目（61100138,61403238,61502287）; 山西省自然科学基金资助项目（2011011016-2,2012021012-1）; 山西省回国留学人员科研项目（2013-022）; 山西省高校科技开发项目（20121117）; 山西省2012年度留学回国人员科技活动择优项目

关键词：时间关系, 时间片段, 主题片段, 语义信息处理, temporal relation, temporal segment, topic segment, semantic information processing

中文摘要：

时间关系的识别成为近年来自然语言处理领域（nature language processing,NLP）的一个研究热点。引入时间片段和主题片段这两种比事件触发词粒度粗的语义单元进行时间关系识别,首先在文本中利用一些时间篇章特点识别时间片段,然后利用相似度计算与支持向量机（support vector maehine,SVM）模型相结合的方法识别主题片段,最后在主题片段范围内,以时间片段为排序对象,使用最大熵分类模型识别时间关系。在TempEval-2010的汉语语料上进行实验,得到的时间关系识别宏平均精确率为60.09%。实验结果表明：引入时间片段后可有效减少不必要的事件时序关系的识别;同时,在主题片段的约束下所得到的时间关系更简洁、语义逻辑性更好。

英文摘要：

Temporal relation recognition is a research focus in NLP（nature language processing）. This paper identifies temporal relations based on temporal segment and topic segment,which semantic granularities were coarser. First,temporal segments were recognized according to temporal discourse characters. Then,topic segments were recognized through computing similarity between paragraphs and the SVM model. Final,within each topic segment,temporal relations between the adjacent temporal segments were identified by maximum entropy classifier. Experiments were made on TempEval-2010 corpus of Chinese,the macro-average precision of temporal relation recognition was 60.09%. The experimental results show that introduction of temporal segments can reduce the redundant recognition of the temporal relations between events. And with the scope constraint of topic segments,the results of temporal relations become more concise and understandable.

同期刊论文项目