东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

语音反演远端监督学习模型研究

ISSN号：1005-3751
期刊名称：计算机技术与发展
时间：2013.3.3
页码：105-108
分类：TP181[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
作者机构：[1]南京邮电大学计算机学院,江苏南京210003
相关基金：国家自然科学基金资助项目（61073115）
相关项目：基于DIVA模型的机器人语音生成与获取小脑控制模型的研究

作者：陈英|张少白|

关键词：发音信息, 语音反演, 远端监督学习, 声道变量, articulatory information , speech inversion , distal supervised learning （DSL） , tract variables

中文摘要：

针对发音信息在话音环境中并不容易得到的问题，提出了一种从听觉信号中预测发音信息的语音反演方法。论文应用远端监督学习（DSL），对语音反演机器学习策略进行研究，并对其实验背景和理论依据进行了分析。论文在提出一种对远端监督学习逆模进行全局优化的方法的同时，通过应用八个声道变量作为发音信息来模拟语音动力学，对语音信号分别被参数化为声学参数（APs）和梅尔频率倒谱系数（MFCCs）时的预测结果进行了比较。结果表明远端监督学习对声道变量有较好的预测性能。

英文摘要：

To the problem that articulatory information is not readily available in typical speakerlistener situations, a method that esti mates articulatory information from the acoustic signal is proposed, namely speech inversion. It selectes distal supervised learning （DSL） as one of machine learning strategies for speech inversion to study, and analyzes the experiment＇s background and theoretical foundation of distal supervised learning. It proposes that use a global optimization approach for the inverse model of distal supervised teaming and eight tract variables as articulatory information to simulate speech dynamics, the results when speech signal is parameterized as acoustic parameters （APs） and as melfrequency cepstral coefficients （MFCCs） are compared in the paper. The results show that distal super vised learning has a good estimation performance for tract variables.

同期刊论文项目