针对情感语音识别与合成的应用需求,设计并建立了一种面向语音情感计算的数据库,包含中立、悲伤、高兴、愤怒4种情感,每种1000句,共4000句情感语音。首先利用贪婪算法对语料进行筛选,然后通过录音截取得到中立语音。再根据情感语音韵律特征规律来修改中立语音获得悲伤、高兴、愤怒等其他3种情感语音。最后将改进的模糊综合评价方法应用于对语音数据从情感表达、清晰度、流畅度、情景感、自然度、噪音影响等6个方面的综合评定,客观而准确地验证了语料的可靠性。本语音库的建立为语音情感计算提供了重要的应用基础和前提。
The database for speech emotion computing is designed and constructed for the requirement of the emotion speech recognition and synthesis, which includes four kinds of emotions, namely neutral, sad, happy and angry, each kind of 1 000 sentences, 4 000 sentences emotional speech in all. First, the corpus is selected using greedy algorithm, then the neutral emotional speech is got by intercepting recording. Secondly, according to the rhythm characteristics of the emotional speech, the neutral speech is modified to obtain the sad, happy and angry emotional speech. Finally an improved fuzzy comprehensive evaluation method is applied to the speech database from six sides comprehensive evaluation, which are emotional expression, clarity, flu- ency, scene sense, naturalness and noise effect. Therefore, it can objectively and accurately prove the reliability of the corpus. The construction of the emotional database for emotion speech computing provides an important application foundation and prerequisite work.