基于语料库的语音合成是国内外应用广泛的语音合成方法.在这种合成方法中,单元选择是语音合成的关键.通过分析藏语言文字的属性特征,设计了藏语语音合成系统模型,提出以构件、组合构件、字、词及句单元相融合的藏语语音合成方法,有效地保留了语音合成中大单元的完整性和小单元的灵活性与鲁棒性.同时,给出语音合成的单元选择策略与算法.实验数据表明:该策略与算法是有效和合理的,所选择的单元在封闭语料和开放语料上的覆盖率均达到预期目标.
Corpus-based speech synthesis is the most widely-used speech synthesis technology in home and abroad. In this type of synthesis method, unit selection is crucial to the speech synthesis. This paper designs a system model of Tibetan speech synthesis by analyzing the attributive characteristics of Tibetan text, and presents a mixed units mode with Tibetan components, combinational components, characters, words and sentences. The method effectively preserves the integrity of larger units and the flexibility and robustness of small units. At the same time, it provides the unit selection strategies and algorithms of Tibetan speech synthesis. Experimental data indicates that the strategies and algorithms are effective and the. coverage of units reaches expected target in both the open corpus and closed corpus.