东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

面向语音合成的藏语音素切分算法研究

ISSN号：1003-0077
期刊名称：《中文信息学报》
时间：0
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]西北民族大学中国民族语言文字信息技术重点实验室,甘肃兰州730030
相关基金：国家自然基金项目（61262054）; 西北民族大学中央高校基本科研业务费专项（ycx12024）

关键词：音素自动切分, 藏语, 语音合成, 语料库, Phoneme automatic segmentation, Tibetan, Speech synthesis, Corpus

中文摘要：

文章通过采用两种方法对藏语语音合成语料库中的语音进行音素切分：一种是基于单音素HMM模型的自动切分方法,一种是传统的人工切分方法,并通过实验分析了自动切分与人工切分方法的准确率程度.实验结果表明：在构建语料库时,前者有助于缩短建库周期,尤其对于大语料库的建立会有明显的优势.这种方法既节省了切分与标注的大量时间和人力成本,又提高了语音语料库标注信息的精确度和一致性.

英文摘要：

This paper adopted two methods being used for phoneme segmentation for Tibetan speech synthesis corpus：one was based on single phoneme HMM model automatic segmentation;the other was the traditional manual segmentation way.The accuracy degree between automatic and manual segmentation was analyzed through the experiments.The results of experiment showed that the automatic segmentation is helpful for shortening the cycle duration in building corpus process,especially for the establishment of large corpus.A lot of time for segmentation and labeling was reduced,the accuracy and consistency of speech corpus labeling information has been improved.

同期刊论文项目