东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于多带的2.4kbit/s波形内插算法

ISSN号：1009-5896
期刊名称：《电子与信息学报》
时间：0
分类：TN912.3[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]东南大学信息科学与工程学院,南京210096
相关基金：国家自然科学基金（60672094）资助课题

关键词：语音编码, 波形内插, 多带, Speech coding, Waveform interpolation, Multiband

中文摘要：

由于传统特征波形内插语音编码算法对特征波形相位信息的忽略,以及对特征波形的整体对齐,往往造成语音高频谐波分量丢失,从而导致语音的噪声感。为了提高合成语音的质量,该文引入语音多带清浊音标志,并以此为依据对波形内插编码模型中的慢渐变波形和快渐变波形的相位谱进行估计,在语音合成时则对特征波形采取部分对齐的方法,最后提出了一种基于多带的2.4kbit/s特征波形内插算法。与传统算法相比,新算法明显提高了语音的清晰度。与标准2.4kbit/sMELP算法相比,该算法合成语音质量亦略显优势。

英文摘要：

In the traditional characteristic waveform interpolation speech coding, the high frequency harmonics of the synthetic speech are usually lost, which makes the speech feel noisy, due to the phase information neglect of the characteristic waveform and the whole characteristic waveform alignment. In order to improve the synthetic speech quality, the multiband surd/sonant flag is first introduced. The phase spectrum of the slowly/rapidly evolving waveform is estimated with the waveform interpolation speech coding model depending on the multiband surd /sonant flag. Then the partial characteristic waveform alignment is used in the speech synthesis section and a 2.4 kbit/s multiband waveform interpolation speech coding algorithm is proposed finally. Compared with the traditional characteristic waveform algorithm, the new algorithm can distinctly improve the speech definition. Compared with the standard 2.4 kbit/s MELP algorithm, the synthetic speech quality is also slightly better.

同期刊论文项目