为了解决语音情感识别系统中训练数据和测试数据来自不同数据库所引起的识别率降低的问题,提出了一种基于稀疏特征迁移的语音情感识别方法。通过引入稀疏编码获取情感特征在不同数据库条件下的共同稀疏表示;同时引入最大区分差异(Maximum mean discrepancy ,MMD)来衡量不同数据库条件下稀疏表示分布之间的距离,并将其作为稀疏编码目标函数的约束条件,从而获得较为鲁棒的稀疏特征。实验结果表明,相比传统语音情感识别方法,基于稀疏特征迁移的语音情感识别方法显著提高了跨库条件下的情感识别率。
In speech emotion recognition system ,recognition rates will drop drastically when the training and the testing utterances are from different corpora .To solve this problem ,a novel sparse feature trans‐fer approach is proposed .By employing sparse coding algorithm ,the common sparse feature representa‐tion of emotion features from different corpora is obtained .Meanwhile ,the maximum mean discrepancy (MMD) algorithm is introduced to measure the distance between different distributions ,and is used as the regularization term for the objective function of sparse coding .Finally ,the robust sparse features are achieved for recognition .Experimental results show that ,compared to traditional methods ,the proposed approach can significantly improve the recognition rates for cross databases .