东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

对MFCC进行GMM聚类的汉语数字识别方法

ISSN号：1003-3254
期刊名称：计算机系统应用
时间：2011
页码：167-170
分类：TP391.4[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]江南大学物联网工程学院,无锡214122
相关基金：国家自然科学基金（61075008）
相关项目：汉语语音信号的时频感知新特征提取的研究

关键词：汉语数字识别, MFCC, GMM聚类, Chinese digital identification, MFCC, GMM clustering

中文摘要：

汉语数字识别常用MFCC作为特征，针对0-9十个数字MFCC样本特征数据量大的问题，提出了用GMM模型对提取的特征参数MFCC的数据进行聚类来减少数据量，以GMM模型参数中的均值作为新的特征，采用动态规划算法进行汉语数字语音识别。仿真实验表明，进行GMM特征变换后的新特征数据为MFCC的30．9％，系统运行时间减少了237．18s，识别率降低1．11％。

英文摘要：

MFCC is widely used in Chinese digital identification. Because the amount of MFCC extracted from 0-9 is too large, the mean of model parameters which is clustered with GMM by MFCC to reduce the amount is employed as a new feature with DTW for Chinese digital identification. Simulation results demonstrate that the amount of the new feature is 30.9% to that of MFCC, the running time reduces by 237.18s, but the recognition rate decreases by 1.11%.

同期刊论文项目