提出了一种新的方言码本模型辨识系统。该方法利用半监督的思想对方言语音数据进行矢量量化,形成具有监督信息的码本模型。有效解决了在汉语方言辨识中码本精度不高的问题,系统的识别率有了很大提高。实验结果表明带有监督信息的码本量化方法明显优于传统LBG矢量量化方法,对于汉语三种方言,辨识率可达94.23%,比传统码本辨识系统提高了近13%的正确辨识率。
This paper presents a noval code model in Chinese dialect identification.This method takes advantage of semi-su-pervised thought to quantitate speech data and forms a code model with supervision information.It effectively solves the problem of low precision code and improves system recognition rate.Experimental results prove that the method with supervision information is superior to traditional LBG quantitation method.For three Chinese dialect,the system can achieve a high accuracy of 94.23% and raise the rate of correct identification about 13% compared with traditional code system.