位置:成果数据库 > 期刊 > 期刊详情页
Speaker-independent speech emotion recognition by fusion of functional and accompanying paralanguage features
  • ISSN号:1006-8961
  • 期刊名称:《中国图象图形学报》
  • 时间:0
  • 分类:TP391.4[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]Department of Computer Science and Communication Engineering, Jiangsu University, China Zhenjiang 212013
  • 相关基金:Project supported by the National Natural Science Foundation of China (Nos. 61272211 and 61170126), the Natural Science Founda- tion of Jiangsu Province (No. BK2011521), and the Research Foun- dation for Talented Scholars of Jiangsu University (No. 10JDG065), China
中文摘要:

Functional paralanguage includes considerable emotion information, and it is insensitive to speaker changes. To improve the emotion recognition accuracy under the condition of speaker-independence, a fusion method combining the functional paralanguage features with the accompanying paralanguage features is proposed for the speaker-independent speech emotion recognition. Using this method, the functional paralanguages, such as laughter, cry, and sigh, are used to assist speech emotion recognition. The contributions of our work are threefold. First, one emotional speech database including six kinds of functional paralanguage and six typical emotions were recorded by our research group. Second, the functional paralanguage is put forward to recognize the speech emotions combined with the accompanying paralanguage features. Third, a fusion algorithm based on confidences and probabilities is proposed to combine the functional paralanguage features with the accompanying paralanguage features for speech emotion recognition. We evaluate the usefulness of the functional paralanguage features and the fusion algorithm in terms of precision, recall, and F1-measurement on the emotional speech database recorded by our research group. The overall recognition accuracy achieved for six emotions is over 67% in the speaker-independent condition using the functional paralanguage features.

英文摘要:

Functional paralanguage includes considerable emotion information, and it is insensitive to speaker changes. To improve the emotion recognition accuracy under the condition of speaker-independence, a fusion method combining the func- tional paralanguage features with the accompanying paralanguage features is proposed for the speaker-independent speech emo- tion recognition. Using this method, the functional paralanguages, such as laughter, cry, and sigh, are used to assist speech emotion recognition. The contributions of our work are threefold. First, one emotional speech database including six kinds of functional paralanguage and six typical emotions were recorded by our research group. Second, the functional paralanguage is put forward to recognize the speech emotions combined with the accompanying paralanguage features. Third, a fusion algorithm based on con- fidences and probabilities is proposed to combine the functional paralanguage features with the accompanying paralanguage features for speech emotion recognition. We evaluate the usefulness of the functional paralanguage features and the fusion algo- rithm in terms of precision, recall, and F 1-measurement on the emotional speech database recorded by our research group. The overall recognition accuracy achieved for six emotions is over 67% in the speaker-independent condition using the functional paralanguage features.

同期刊论文项目
同项目期刊论文
期刊信息
  • 《数码影像》
  • 主管单位:
  • 主办单位:中国图象图形学学会 中科院遥感所 北京应用物理与计算数学研究所
  • 主编:
  • 地址:北京市海淀区花园路6号
  • 邮编:100088
  • 邮箱:
  • 电话:010-86211360 62378784
  • 国际标准刊号:ISSN:1006-8961
  • 国内统一刊号:ISSN:11-3758/TB
  • 邮发代号:
  • 获奖情况:
  • 国内外数据库收录:
  • 被引量:0