为了提高音频水印的检测性能,基于音频帧MFCC特征的统计特性,提出了一种音频水印盲检测方法。在音频帧的DCT系数上嵌入扩频水印,对嵌入水印的音频帧和原始音频帧分别提取MFCC特征进行训练,分别建立高斯混合模型,并通过变分贝叶斯学习方法估计出高斯混合模型的参数,检测时依据最大似然的原则。实验结果显示提出的方法在音频信号受到噪声干扰和恶意攻击的情况下,相对基于EM算法的方法在误检率上有明显降低,在小样本训练情况下具有更好的效果并且可以有效避免过拟合的问题。
In order to improve the performance of audio watermarking detection, a blind audio watermarking mechanism using the statistical characteristics based on MFCC features of audio frames was proposed. The spread spectrum watermarking was embedded in the DCT coefficients of audio frames. MFCC features extracted from watermarked audio frames as well as un-watermarked ones were trained to establish their Gaussian mixture models and to estimate the parameters by vatiational Bayesian learning method respectively. The watermarking was detected according to the maximum likelihood principle. The experimental results show that our method can lower the false detection rate compared with the method using EM algorithm when the audio signal was under noise and malicious attacks. Also, the experiments show that the proposed method achieves better performance in handling insufficient training data as well as getting rid of over-fitting problem.