东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于矢量泰勒级数的鲁棒语音识别

ISSN号：0493-2137
期刊名称：天津大学学报
时间：0
页码：261-265
分类：TN912.34[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]东南大学信息科学与工程学院,南京210096
相关基金：国家自然科学基金资助项目（60971098）
相关项目：基于小型麦克风阵列的声源定位方法研究

作者：吕勇|吴镇扬|

关键词：特征补偿, 矢量泰勒级数, 噪声估计, 鲁棒语音识别, feature compensation, vector Taylor series, noise estimation, robust speech recognition

中文摘要：

矢量泰勒级数是一种有效的抗噪声鲁棒语音识别算法.然而在对数谱域,美尔滤波器组的不同通道之间有较强的相关性,因而难以从含噪语音中准确估计噪声的方差.提出了一种基于矢量泰勒级数的倒谱域特征补偿算法.该算法在倒谱域,用一个高斯混合模型描述语音倒谱特征的分布,通过矢量泰勒级数从含噪语音中估计噪声的均值和方差.实验结果表明,此算法能明显提高语音识别系统的性能,优于基于矢量泰勒级数的对数谱域特征补偿算法.

英文摘要：

The vector Taylor series（VTS）expansion is an effective approach to noise robust speech recognition.How-ever,in the log-spectral domain,there exist the strong correlations among the different channels of Mel filter bank and thus it is difficult to estimate the noise variance from noisy speech proposes.A feature compensation algorithm in the cepstral domain based on vector Taylor series was proposed.In this algorithm,the distribution of speech cepstral features was represented by a Gaussian mixture model（GMM）,and the mean and variance of noise were estimated from noisy speech by the VTS approximation.The experimental results show that the proposed algorithm can signifi-cantly improve the performance of speech recognition system,and outperforms the VTS-based feature compensation method in the log-spectral domain.

同期刊论文项目