位置:成果数据库 > 期刊 > 期刊详情页
  • 期刊名称:中文信息学报
  • 时间:0
  • 页码:104-109
  • 语言:中文
  • 分类:TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]哈尔滨工业大学教育部—微软语言语音重点实验室,黑龙江哈尔滨150001
  • 相关基金:基金项目:国家自然科学基金资助项目(60736014);国家863计划资助项目(2006AA010108)
  • 相关项目:融合语言知识与统计模型的机器翻译方法研究



In Chinese-English statistical machine translation (SMT), Chinese texts usually demands Chinese word segmentation (CWS) to identify the words in a sentence. However, CWS is not developed for SMT and hence its results are not necessarily optimal for SMT. In recent years, many investigations have been performed concerning making CWS suitable for SMT, but we explore it from another direction. In this paper, our basic idea is to use multiple CWS results as additional language knowledge source and we present a simple and effective approach to use multiple CWS results for SMT. We also give experiment results over a series of combining strategy, and the best result shows 1.89 percentage gain in BLEU points over a start-of-the-art SMT system.

期刊论文 77 会议论文 94 专利 4 著作 2