东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于听觉感知的语音稀疏表示及压缩感知

ISSN号：1000-310X
期刊名称：应用声学
时间：0
页码：-
分类：TN912.3[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]西安通信学院,西安710106
相关基金：国家自然科学基金项目（61072125）
相关项目：基于压缩感知的语音信号建模与编码技术研究

关键词：语音信号, 稀疏表示, 听觉感知, 压缩感知, Speech signal, Sparse representation, Auditory perception, Compressed sensing

中文摘要：

本文针对语音信号稀疏表示及压缩感知问题，将听觉感知引入稀疏系数筛选过程，用掩蔽阈值筛选重要系数，以得到更符合听觉感受的语音稀疏表示。通过对一帧浊音信号分别采用掩蔽阈值和能量阈值方法进行系数筛选对比实验，结果表明掩蔽阈值法具有更好的稀疏表示效果。为验证听觉感知对语音压缩感知性能的影响，与能量阈值法对照对测试语音进行压缩感知观测和重构，通过压缩比、信噪比、主观平均意见分等主客观指标评价其性能，结果表明，掩蔽阈值法可有效地提高压缩比且保证重构语音具有较高的主观听觉质量。

英文摘要：

This paper concerns the sparse representation and compressed sensing for speech signal, the auditory perception was brought into the selection of sparse coefficients to obtain a sparse representation which is more suitable to hearing. An experiment of sparse coefficients＇ selection under masking thresholds method was done comparing with the energy threshold method, the result showed the masking thresholds method was better. In order to validate the impact of the auditory perception model to compressed sensing for speech signal, the testing speeches were chosen to compress under compressed sensing framework by masking threshold method and energy threshold method, through the subjective and objective indicators, such as compression ratio, signal noise ratio and mean opinion score, a conclusion was made： the masking threshold method can lead a increasing of compression ratio while the quality of hearing for reconstructed signal is not decreased.

同期刊论文项目