为了解决可视语音合成中语音与口型多对多的对应关系,本文提出基于两层隐马尔可夫模型的可视语音合成,该模型有效结合了语音和口型的上下文相关性,解决了语音与口型多对多的对应问题,合成出了准确率高、连贯、自然的口型序列,该方法具有完全自动化的特点。
In order to solve the problem of many to many mapping between speech and lip in visual speech synthesis, this paper proposed a method based on two-level HMM for visual speech synthesis, which combined the context of audio and mouth effectively, solved many to many correspondence between audio and mouth, synthesized accurate, coherent and natural mouth sequence. And this way is also roboticized completely.