东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

说话人识别的特征组合方法

ISSN号：1001-9081
期刊名称：《计算机应用》
时间：0
分类：TN912[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：湘潭大学物理与光电工程学院,湖南湘潭411105
相关基金：国家自然科学基金资助项目（60972147）

关键词：说话人识别, 梅尔频率倒谱系数, 线性预测系数, 矢量量化, 高斯混合模型, speaker recognition, Mel frequency cepstrum coefficient, linear prediction coefficient, Vector Quantization（VQ）, Gaussian Mixture Model（GMM）

中文摘要：

针对说话人识别中线性预测系数（LPC）和梅尔频率倒谱系数（MFCC）的直接组合会增加特征参数的维数和出现运算量大的问题,提出了将LPC参数融入到MFCC参数的计算中的特征提取方法。首先,计算语音信号的LPC系数,求出LPC功率谱;其次,将LPC功率谱通过三角形滤波器组,并取对数;最后,将取对数后的输出做离散余弦变换,得到新特征线性预测梅尔频率倒谱系数（LPMFCC）。LPMFCC参数兼具LPC参数的声道特性和MFCC参数的听觉特性,虽增加了一步计算,但不增加参数的维数,运算量相对较少。实验结果表明,在纯净语音环境下,提出的LPMFCC参数的说话人识别率较LPC参数和MFCC参数的说话人识别率分别提升了18.57%和10%,在不同噪声环境下,分别提高了13.22%和4.55%。

英文摘要：

Focusing on the issue that direct combination of Linear Prediction Coefficient（ LPC） and Mel Frequency Cepstrum Coefficient（ MFCC） will increase the dimension of the feature parameters and lead to heavy computation,the method by integrating LPC parameters into the computation of MFCC parameters was proposed. Firstly,LPC parameters from speech singal were calculated and the speech power spectrum of LPC were gotten; Secondly,the logarithm of output by making the speech power spectrum of LPC through triangular filter group was conducted. Finally,the output of logarithm was transformed by discrete cosine transform,and a new feature factor which is called Linear Prediction Mel Frequency Cepstrum Coefficient（ LPMFCC） was obtained. LPMFCC parameters had both vocal track of LPC parameters and auditory of MFCC parameters. Although increasing a step of computation,the dimension of parameters was not increased and computation cost was relatively low. The simulation results show that the speaker recognition rate of the proposed LPMFCC parameters promotes by 18. 57% and 10% than the speaker recognition rate of LPC parameters and MFCC parameters respectively in the pure voice database,while by 13. 22% and 4. 55% respectively in various noise environments.

同期刊论文项目

微弱信号检测的耦合混沌电路设计及混合集成

期刊论文 18 会议论文 4 专利 1

同项目期刊论文

一类自治混沌系统的动力学分析与电路实现

可集成互感耦合电路的实现

一种10位电容电阻混合型双极性D/A转换器

Chen系统的非共振参数控制

改进型Hénon映射生成混沌伪随机序列及性能分析

用多项式和阶跃函数构造网格多涡卷混沌吸引子及其电路实现

加权最小二乘法设计分数延迟FIR数字滤波器

混沌背景中微弱信号参数估计新方法

多功能AB类四象限模拟乘法器

混沌噪声背景下周期小信号频率估计新方法

基于MA型分布式算法的高阶FIR滤波器设计及其FPGA实现

线性可调全差分OTA的实现

嵌入式I2C从机设计、验证与版图实现

基于双T网络的忆阻器振荡器

Chen系统在微弱信号检测中的应用

真空气相沉积生长PTCDI-C8 N型有机薄膜晶体管

MLC高阶振荡电路的研究与设计

期刊信息

《计算机应用》
北大核心期刊（2011版）

主管单位:四川省科学技术协会
主办单位:四川省计算机学会中国科学院成都分院
主编：张景中
地址：成都市人民南路四段九号科分院计算所
邮编：610041
邮箱：xzh@joca.cn
电话：028-85224283

国际标准刊号：ISSN：1001-9081
国内统一刊号：ISSN：51-1307/TP
邮发代号:62-110

获奖情况:
全国优秀科技期刊一等奖,国家期刊奖提名奖,中国期刊方阵双奖期刊,中文核心期刊,中国科技核心期刊

国内外数据库收录:
俄罗斯文摘杂志,波兰哥白尼索引,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:53679