非母语语音识别的性能较低,对于刚开始学习目标语言的说话人或者口音很重的说话人而言,性能下降更为明显。本文提出一种新型的双语模型修正算法用于提高非母语语音的识别性能。在该算法中,基线声学模型的每个状态都将被代表说话人母语特点的辅助模型状态所修正。文章给出了状态修正准则以及不同候选修正状态数下的性能比较。相比已用非母语训练数据自适应以后的基线声学模型,通过双语模型修正的声学模型在保证识别实时率的前提下,短语错误率相对下降了11.7%。
The performance of automatic speech recognition decreases drastically for nonnative speakers, especially those who are just beginning to learn foreign language or who have heavy accents. A novel bilingual model modification approach is presented to improve nonnative speech recognition accuracy. Each state of baseline nonnative acoustic model is modified with several candidate states from the auxiliary acoustic model, which is trained by speakers' mother tongue. State mapping criterion and n-best candidates are investigated. Using this bilingual model modification approach, compared to the nonnative acoustic model which has already been well trained by adaptation technique MAP, the phrase error rate further is reduced by 11.7% relatively, while only a small relative increase on real time factor occurs.