位置:成果数据库 > 期刊 > 期刊详情页
Intelligibility enhancement for noisy whispered speech using asymmetric cost function
  • ISSN号:0217-9776
  • 期刊名称:《声学学报:英文版》
  • 时间:0
  • 分类:TN912.33[电子电信—通信与信息系统;电子电信—信息与通信工程] F224.0[经济管理—国民经济]
  • 作者机构:[1]Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University Hefei 230601, [2]Key Laboratory of Underwater Acoustic Signal Processing of Ministry of Education, Southeast University Nanjing 210096, [3]Key Laboratory of Child Development and Learning Science of Ministry of Education, Southeast University Nanjing Jiangsu 210096
  • 相关基金:supported by the National Natural Science Foundation of China(61301295,61273266,61231002); the Natural Science Foundation of Anhui Province(1308085QF100,1408085MF113); the Doctoral Fund of Anhui University
中文摘要:

We proposed two whispered speech enhancement methods based on asymmetric cost functions in this paper to deal with the amplification and attenuation distortions of whispered speech distinctively.The modified Itakura-Saito(MIS)distance function provides more penalties to speech amplification distortion,whereas the Kullback-Leibler(KL)divergence function gives more penalties to speech attenuation distortion.The experimental results show that the MIS function based method achieves significant improvement of intelligibility in contrast to the conventional speech enhancement algorithms when the signal-to-noise ratio(SNR)falls below-6 dB,whereas the KL function based one achieves the similar result as the minimum mean square error(MMSE)speech enhancement method.The results show that the effects of the amplification and attenuation distortions on the intelligibility of the enhanced whisper are different,where larger attenuation distortion may result in better intelligibility of speech with low SNR.However,the attenuation distortion has small effects on intelligibility of speech with high SNR.

英文摘要:

We proposed two whispered speech enhancement methods based on asymmetric cost functions in this paper to deal with the amplification and attenuation distortions of whispered speech distinctively.The modified Itakura-Saito(MIS)distance function provides more penalties to speech amplification distortion,whereas the Kullback-Leibler(KL)divergence function gives more penalties to speech attenuation distortion.The experimental results show that the MIS function based method achieves significant improvement of intelligibility in contrast to the conventional speech enhancement algorithms when the signal-to-noise ratio(SNR)falls below-6 dB,whereas the KL function based one achieves the similar result as the minimum mean square error(MMSE)speech enhancement method.The results show that the effects of the amplification and attenuation distortions on the intelligibility of the enhanced whisper are different,where larger attenuation distortion may result in better intelligibility of speech with low SNR.However,the attenuation distortion has small effects on intelligibility of speech with high SNR.

同期刊论文项目
同项目期刊论文
期刊信息
  • 《声学学报:英文版》
  • 主管单位:
  • 主办单位:中国科学院声学所 中国声学会
  • 主编:
  • 地址:北京北四环西路21号
  • 邮编:100080
  • 邮箱:jsx@mail.ioa.ac.cn
  • 电话:010-62558329
  • 国际标准刊号:ISSN:0217-9776
  • 国内统一刊号:ISSN:11-2066/O3
  • 邮发代号:
  • 获奖情况:
  • 国内外数据库收录:
  • 美国应用力学评论
  • 被引量:47