母语与非母语英语发音方式通常存在固有的差别,这导致基于母语发音训练的语音识别模型不能适应非母语说话人。一种有效的方法是建立模型的补偿机制,来容忍母语与非母语说话人之间的发音变化。分析了中国人受母语的影响带来的英语发音变化,针对音素变化和声音变化,分别采用多发音字典和模型融合技术,实现了中国人说英语的语音识别率提高了15%,但母语英语的语音识别率下降不到1%。
The inherent differences between native and non-native language pronunciation can lead to non-native language rate of decline using the model trained with native language speech. The confusions between Native and non-Native speaker lead the rate of decline. , which need to create a new model to tolerance this change. Set up on baseline Native English recogntion system, the character of Chinese people speaking English is firstly analyzed in this paper. We propose to analyze and model the phonetic and acoustic confusuons separately, using pronunciation dictionary and acoustic model merging technology to create a new model, with a significant 15% absolute WER reduction on the Chinese English, which only sacrifics 1% recognition rate on the native English.