位置:成果数据库 > 期刊 > 期刊详情页
a modified voice conversion algorithm using compressed sensing
  • ISSN号:0217-9776
  • 期刊名称:声学学报(英文版)
  • 时间:2014
  • 页码:323-333
  • 分类:TP311.13[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术] TN912.3[电子电信—通信与信息系统;电子电信—信息与通信工程]
  • 作者机构:[1]School of Communication Engineering, Hangzhou DianZi University Hangzhou 310018, [2]Inst. of Electronic and Information Engineering, Shanghai Univ. of Electric Power Shanghai 200090
  • 相关基金:supported by the National Natural Science Foundation of China(61201301); Program of Zhejiang Provincial Education Department(Y201016542)
  • 相关项目:用于非对称语料的语音转换函数训练算法研究
作者: 简志华|
中文摘要:

A voice conversion algorithm,which makes use of the information between continuous frames of speech by compressed sensing,is proposed in this paper.According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs(LSP)in the discrete cosine transformation domain,this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function.The results of evaluations demonstrate that the performance of this approach can averagely improve 3.21%with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame.The experimental results also illustrate that the performance of voice conversion system can be improved by taking full advantage of the inter-frame information,because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames.

英文摘要:

A voice conversion algorithm,which makes use of the information between continuous frames of speech by compressed sensing,is proposed in this paper.According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs(LSP)in the discrete cosine transformation domain,this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function.The results of evaluations demonstrate that the performance of this approach can averagely improve 3.21%with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame.The experimental results also illustrate that the performance of voice conversion system can be improved by taking full advantage of the inter-frame information,because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames.

同期刊论文项目
同项目期刊论文
期刊信息
  • 《声学学报:英文版》
  • 主管单位:
  • 主办单位:中国科学院声学所 中国声学会
  • 主编:
  • 地址:北京北四环西路21号
  • 邮编:100080
  • 邮箱:jsx@mail.ioa.ac.cn
  • 电话:010-62558329
  • 国际标准刊号:ISSN:0217-9776
  • 国内统一刊号:ISSN:11-2066/O3
  • 邮发代号:
  • 获奖情况:
  • 国内外数据库收录:
  • 美国应用力学评论
  • 被引量:47