东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于模型融合的母语与非母语语音识别

ISSN号：1002-7300
期刊名称：电子测量技术
时间：0
页码：81-115
语言：中文
分类：TP391.42[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]清华大学电子工程系清华信息科学与技术国家实验室（筹）,北京100084
相关基金：国家自然科学基金委员会与微软亚洲研究院联合资助项目60776800;国家高技术研究发展计划（863计划）：项目2006AA010101、项目2007AA04Z223、项目2008AA02Z414.
相关项目：基于内容的跨语言语音检索方法研究

作者：曾定|刘加|

关键词：语音识别, 非母语, 模型融合, 多发音字典, speech recognition, non-native, model merging, pronunciation dictionary

中文摘要：

母语与非母语英语发音方式通常存在固有的差别，这导致基于母语发音训练的语音识别模型不能适应非母语说话人。一种有效的方法是建立模型的补偿机制，来容忍母语与非母语说话人之间的发音变化。分析了中国人受母语的影响带来的英语发音变化，针对音素变化和声音变化，分别采用多发音字典和模型融合技术，实现了中国人说英语的语音识别率提高了15％，但母语英语的语音识别率下降不到1％。

英文摘要：

The inherent differences between native and non-native language pronunciation can lead to non-native language rate of decline using the model trained with native language speech. The confusions between Native and non-Native speaker lead the rate of decline. , which need to create a new model to tolerance this change. Set up on baseline Native English recogntion system, the character of Chinese people speaking English is firstly analyzed in this paper. We propose to analyze and model the phonetic and acoustic confusuons separately, using pronunciation dictionary and acoustic model merging technology to create a new model, with a significant 15% absolute WER reduction on the Chinese English, which only sacrifics 1% recognition rate on the native English.

同期刊论文项目