东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

新疆非母语汉语语音识别中的字典自适应技术

ISSN号：1002-8331
期刊名称：计算机工程与应用
时间：0
页码：141-144
分类：TN912.34[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]新疆大学信息科学与工程学院多语种信息实验室,乌鲁木齐830046
相关基金：国家自然科学基金No.60965002; 新疆高校科研计划培育基金（No.XJEDU2008S15）; 新疆大学博士科研启动基金（No.BS090143）~~
相关项目：面向新疆少数民族汉语语言学习的自动发音错误检测方法的研究

作者：李兵虎|黄浩|

关键词：发音字典, 音素混淆矩阵, 剪枝策略, 新疆维吾尔族说话人, 非母语汉语语音识别, pronunciation dictionary, phoneme confusion matrix, pruning strategy, Uighur speakers in Xinjiang, non-native Mandarin speech recognition

中文摘要：

将标准普通话语音数据训练得到的声学模型应用于新疆维吾尔族说话人非母语汉语语音识别时,由于说话人的普通话发音存在较大偏误,将导致识别率急剧下降。针对这一问题,将多发音字典技术应用于新疆维吾尔族说话人汉语语音识别中,通过统计分析识别器的识别错误,建立音素混淆矩阵,获取音素的发音候选项。利用剪枝策略对发音候选项进行剪枝整合,扩展出符合维吾尔族说话人汉语发音规律的替代字典。对三种剪枝方法产生的发音字典的识别结果进行了对比。实验结果表明,使用相对最大剪枝策略产生的发音字典可以显著提高系统识别率。

英文摘要：

When acoustic models trained on standard Mandarin speech database are applied to Putonghua speech uttered by Uighur speakers in Xinjiang,because of the significant pronunciation deviation of the speakers,recognition accuracy would drop dramatically.To solve this problem,the multi-pronunciation dictionary technique is adopted to improve the performance of non-native speech recognition.Statistical analysis of recognition errors is carried out to build phoneme confusion matrices from which pronunciation candidates can be made.Three pruning schemes are evaluated to best remove the useless pronunciation alternatives.The resulting pronunciation candidates are used to expand pronunciation dictionary for non-native speech recognition.Experimental results on continuous speech recognition show significant improvement can be obtained using resulting pronunciation dictionary.

同期刊论文项目