东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于电话语料的维吾尔连续音素识别

ISSN号：1002-0802
期刊名称：通信技术
时间：2012.7
页码：54-56
分类：TN912.34[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]新疆大学信息科学与工程学院,新疆乌鲁木齐830046
相关基金：新世纪优秀人才支持计划资助（No.NCET-10-0969）;国家自然科学基金资助项目（批准号：61163032）.
相关项目：维吾尔语语素结构规则及其应用研究

关键词：维吾尔语, 声学模型, 维吾尔语音素, 电话语音语料库, HTK工具, Uyghur language, acoustical model Uyghur phonme, telephone speech corpus, HTK tool

中文摘要：

结合维吾尔语的语音特征和语义信息,在大量电话语音语料库的基础上,以建立维吾尔语连续音素识别平台为目标,通过构建隐马尔科夫模型工具HTK（Hidden Markov Model Toolkit）工具实现了维吾尔语连续音素识别算法：首先根据具体技术指标完成了较大规模电话语音语料库的录制和标注工作;确定音素为基元,通过训练获得了每个音素的HMM（Hidden Markov Model）声学模型,随后对输入的语音进行识别,声学模型在不同的高斯混合数目下,得出了识别结果;统计了32个音素的识别率并对它进行分析,为了进一步提高识别率奠定了基础。

英文摘要：

Combined the characteristics and semantic information of Uyghur language, and based on the large number of telephone speech corpus, an continuous phoneme recognition platform of Uyghur language is established, and the recognition algorithm of Uyghur continuous phoneme is implemented by using the HTK tool： first, according to the specific technical indicators, the recording and labeling of large-scale telephone speech corpus is done and with phoneme as the primitive and through training, the HMMmodel of each phoneme is achieved, then the input speech is recognized, and the different recognition rates of the acoustic model under different number of Gaussian mixtures are obtained. And the statistics and analysis on the recognition rates of 32 phonemes is done, thus laying a foundation for further improvement of the recognition rate.

同期刊论文项目