提出了一种正交拉普拉斯语种识别方法,即在提取语音的i-vector后,采用正交局部保持投影进行子空间映射,将信号整体空间映射到语言信息加信道信息子空间,然后对映射后的矢量进行信道补偿处理,最后用支持向量机进行识别.尽管i-vector最大限度地保留了语音的声学信息,但是并没有发现这些信息之间的内在结构.利用正交局部保持投影在去除声学无关信息的基础上,进一步发现声学特征的内在结构,能够有效地提高特征的区分性.在对NISTLRE2003测试数据库实验后,发现新方法相较于基线系统来说,平均代价降低了28.91%.
An orthogonal Laplacian language recognition approach is proposed. In this approach, the i-vector of an utterance, after being extracted, is mapped into a subspace by an orthogonal locality preserving projection. Then, channel compensation is done for the mapped vector. At last, recognition is done with a support vector machine. Though the i-vector preserves the acoustics information as much as possible, it cannot find the inner structure among this information. Whereas the intrinsic structure of acoustics feature can be found by the orthogonal locality preserving projection algorithm on the basis of removing the irrelevant information. Experiments on the NIST LRE 2003 evaluation corpus show that this new approach can reduce a 28.91% average detection cost compared to the baseline.