传统的MFCC及短时能量特征只反映了信号序列的静态特征,目前基于这些特征的语音/音乐识别率为79%~86%。样本熵可以反映信号序列中的新信息量的大小以及新信息量的变化程度。以样本熵作为特征对语音/音乐进行分类识别,提取混合信号的样本熵,计算每段信号样本熵的均值和方差,采用k均值聚类进行识别。仿真实验结果表明,基于样本熵的语音/音乐识别的识别率可提高到88.073%。
Mel frequency cepstral coefficients and short time energy only reflect the static characteristics in signal sequence and the recognition rate of speech/music discrimination is 79%~86%.Sample entropy reflects the size and variational extent of new information in signal sequence.This paper conducts speech/music discrimination using sample entropy.The mean and variance of the sample entropy are calculated after extracting the sample entropy of mixed signals,then each signal is recognized by k-means cluster.Simulation experimental results show that the recognition rate of speech/music discrimination reaches 88.073% when using sample entropy.