东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

汉语耳语音孤立字识别研究

ISSN号：1000-310X
期刊名称：《应用声学》
时间：0
分类：TN912.34[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]南京大学声学所近代声学重点实验室,南京210093
相关基金：国家自然科学基金项目（60272037和60340420325）

关键词：耳语音, 语音识别, 语音增强, Whispered speech, Speech recognition, Speech enhancement

中文摘要：

耳语音识别有着广泛的应用前景,是一个全新的课题.但是由于耳语音本身的特点,如声级低、没有基频等,给耳语音识别研究带来了困难.本文根据耳语音信号发音模型,结合耳语音的声学特性,建立了一个汉语耳语音孤立字识别系统.由于耳语音信噪比低,必须对其进行语音增强处理,同时在识别系统中应用声调信息提高了识别性能.实验结果说明了MFCC结合幅值包络可作为汉语耳语音自动识别的特征参数,在小字库内用HMM模型识别得出的识别率为90.4%.

英文摘要：

The whispered speech recognition is a new subject which has wide applications. However, the characteristics of whispered speech such as its low sound pressure level and the lack of fundamental frequency bring difficulty to the whispered speech recognition. In this paper, a Chinese isolated word recognition system is established based on the source-filter generation model combined with the acoustic characteristics of whispered speech. In addition, the speech enhancement algorithm is added to the system to improve the SNR of whispered speech, and the tone information is implemented to acquire better recognition performance. The experimental results demonstrate that the MFCC combined with the amplitude contour features can be used as efficient parameters for the Chinese whispered speech recognition. A recognition rate of 90.4% is obtained when a small Chinese isolated word database is tested using HMM approach.

同期刊论文项目