位置:成果数据库 > 期刊 > 期刊详情页
汉语数字助听器高频听损增强方法的实验研究
  • ISSN号:0371-0025
  • 期刊名称:声学学报
  • 时间:0
  • 页码:527-533
  • 分类:TN912.35[电子电信—通信与信息系统;电子电信—信息与通信工程]
  • 作者机构:[1]东南大学水声信号处理教育部重点实验室,南京210096, [2]安徽大学智能计算与信号处理教育部重点实验室,合肥230601
  • 相关基金:The National Natural Science Foundation of China (No.61231002,61273266,51075068,60872073,60975017, 61003131); the Ph.D.Programs Foundation of the Ministry of Education of China(No.20110092130004); the Science Foundation for Young Talents in the Educational Committee of Anhui Province(No. 2010SQRL018); the 211 Project of Anhui University(No.2009QN027B)
  • 相关项目:耳语音情感特征分析与识别方法研究
中文摘要:

提出了一种基于机器学习的耳语音可懂度增强方法.该方法利用已经训练好的2类支持向量机来估计一个二元时频掩蔽值,进而合成增强后的耳语音.输入支持向量机的特征向量GFCCs是基于听觉外周模型进行提取的,具有噪声鲁棒特性.在增强仿真实验中,将该算法同传统语音增强算法进行语音可懂度增强性能比较.客观评价和主观听力实验结果均表明,所提出的方法能有效提高含噪耳语音的听觉可懂度;相比谱减法和log-MMSE方法在低信噪比时无法提高语音可懂度,该方法在低信噪比时仍可有效提高含噪耳语音的听觉可懂度.此外,含噪耳语音通过所提出的方法进行增强后,其可懂度比未增强时明显提高.

英文摘要:

A machine learning based speech enhancement method is proposed to improve the intelligibility of whispered speech. A binary mask estimated by a two-class support vector machine (SVM) classifier is used to synthesize the enhanced whisper. A novel noise robust feature called Gammatone feature cosine coefficients (GFCCs) extracted by an auditory periphery model is derived and used for the binary mask estimation. The intelligibility performance of the proposed method is evaluated and compared with the traditional speech enhancement methods. Objective and subjective evaluation results indicate that the proposed method can effectively improve the intelligibility of whispered speech which is contaminated by noise. Compared with the power subtract algorithm and the log-MMSE algorithm, both of which do not improve the intelligibility in lower signal-to-noise ratio (SNR) environments, the proposed method has good performance in improving the intelligibility of noisy whisper. Additionally, the intelligibility of the enhanced whispered speech using the proposed method also outperforms that of the corresponding unprocessed noisy whispered speech.

同期刊论文项目
期刊论文 114 会议论文 7 专利 35
同项目期刊论文
期刊信息
  • 《声学学报》
  • 中国科技核心期刊
  • 主管单位:中国科学院
  • 主办单位:中国科学院声学研究所
  • 主编:王小民
  • 地址:北京北四环西路21号
  • 邮编:100190
  • 邮箱:
  • 电话:010-62558329
  • 国际标准刊号:ISSN:0371-0025
  • 国内统一刊号:ISSN:11-2065/O4
  • 邮发代号:2-181
  • 获奖情况:
  • 中国期刊方阵“双效”期刊
  • 国内外数据库收录:
  • 荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,美国应用力学评论,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版),中国北大核心期刊(2000版)
  • 被引量:8376