东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于声音多特征贝叶斯网络融合的话者识别研究

ISSN号：0254-3087
期刊名称：《仪器仪表学报》
时间：0
分类：TP391.42[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]上海理工大学机械工程学院,上海200093, [2]上海大学机电工程与自动化学院,上海200072
相关基金：国家自然科学基金（50975179）、上海市教委科研创新项目（11ZZ136）、上海市科委科研计划项目（12DZ2252300）资助项目

关键词： MFCC特征, 1, 3倍频程特征, 贝叶斯网络, 后验概率, 话者识别, 融合, Mel-frequency cepstrum coefficients （MFCC） feature , one third octave feature , Bayesian network , poste-riori probability , speaker recognition, fusion

中文摘要：

针对基于语音单一特征提取方法所存在的话者识别准确率较低的问题，提出将话者语音中反映人耳听觉感知特性的MFCC特征和接近心理声学临界频带的1／3倍频程（1／3octave）特征作为话者声音的特征参数，设计话者识别的贝叶斯网络，融合2种声音特征参数，通过贝叶斯网络推理实现话者识别。贝叶斯网络通过学习过程确定已注册话者各声音特征的条件概率。进行话者识别时，贝叶斯网络利用贝叶斯定理及条件独立性假设融合待识别话者声音的MFCC特征和1／3倍频程特征，计算每个已注册话者对输入语音特征矢量的后验概率，根据后验概率的大小实现待识别话者的推断。话者识别实验结果表明：提出的基于声音多特征贝叶斯网络融合的话者识别方法可行有效，识别正确率达到100％。

英文摘要：

Aiming at the low recognition accuracy problem of speaker recognition based on voice single feature extrac- tion method,in the voice of speakers, the Mel-frequency cepstrum coefficients （MFCC） feature that reflects human auditory perception characteristics and the one third octave feature that is close to the psychological acoustic critical band are extracted as the feature parameters of speakers＇ voice;the Bayesian network for speaker recognition is de- signed to fuse the two kinds of voice feature parameters, and finally speaker recognition is achieved with Bayesian network inference. Bayesian network determines the conditional probability for each voice feature of the registered speakers through the learning process. When the speaker recognition is carried out, the Bayesian network fuses the MFCC feature and 1/3 octave feature of the voice of the speaker to be identified by using the Bayes theorem and con- ditional independence assumption, calculates the posteriori probability of each registered speaker with respect to the input voice feature vectors, and realizes the inference of the speaker to be identified according to the values of the posterior probability. The speaker recognition experiment results show that the proposed speaker recognition method based on muhiple voice features Bayesian network fusion in this paper is feasible and effective, and the correct recog- nition rate can reach up to 100%.

同期刊论文项目

骨折愈合中创伤断面应力环境的实时测控机理与实验研究

期刊论文 26 会议论文 8 获奖 2 专利 7 著作 2

同项目期刊论文

Static Calibration and Decoupling of Multi-dimensional Force Sensor Based on GM(0,N) Model

基于灰关联分析和模糊隶属度匹配的球形水果自动识别方法

基于弱化缓冲算子和GM(1,1)等维新息模型的骨折愈合应力预测

基于灰色神经网络的骨折愈合应力预测

Sensor Calibration Model Based on the Grey Linear Regression Combined Model

基于灰色关联补偿控制的气动位置伺服控制系统

Grey Predictive Control of Stress on Trauma Section during Union of Fracture

A Quantitative Evaluation Method for Fracture Union Quality Based on Grey Relational Analysis

基于强化缓冲算子的灰色预测PID控制仿真研究

基于改进免疫遗传算法的骨折创伤断面生理刺激应力寻优方法

Text-independent Speaker Recognition Based on One Third Octave Feature and Grey Relational Analysis

气动肌腱驱动的拮抗式仿生关节位置/刚度控制

一种新的预测铣刀刀尖频响函数的方法

基于改进自适应遗传算法的固定结合面动态特性参数优化识别

基于RCSA的深孔内圆磨床主轴端点频响函数预测

一种改进的基于响应耦合子结构法的刀尖点频响函数预测方法

轮腿式爬楼梯移动机器人的设计及运动特性分析

四足机器人对角小跑步态动态稳定步行足端非连续约束及动力学建模

65Mn钢圆锯片回火过程的有限元模拟及加热方法改进

基于灰色预测．模糊控制提高PZT的运动精度

基于结合部刚度特性的滚珠丝杠进给系统动态特性分析

基于异源信号特征融合的刀具磨损状态识别

基于频响函数分析的主轴-刀柄-刀具结合面轴向分布参数辨识

基于神经网络反馈补偿控制的磁悬浮球位置控制

基于灰色绝对关联度的角点检测算法

期刊信息

《仪器仪表学报》
中国科技核心期刊

主管单位:中国科学技术协会
主办单位:中国仪器仪表学会
主编：张钟华
地址：北京东城区北河沿大街79号
邮编：100009
邮箱：yqyb@vip.163.com
电话：010-84050563

国际标准刊号：ISSN：0254-3087
国内统一刊号：ISSN：11-2179/TH
邮发代号:2-369

获奖情况:
1983年评为机械部科技进步三等奖,1997年评为中国科协优秀科技期刊三等奖

国内外数据库收录:
美国化学文摘（网络版）,荷兰文摘与引文数据库,美国工程索引,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,英国英国皇家化学学会文摘,中国北大核心期刊（2000版）

被引量:42481