东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

应用二叉树剪枝识别韵律短语边界

期刊名称：2006，中文信息学报，第三期
时间：0
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]北京语言大学语言信息处理研究所,北京100083, [2]富士通研究开发中心,北京100016
相关基金：国家自然科学基金资助项目（60573184）
相关项目：非母语写作水平计算机自动评测技术研究

关键词：人工智能, 自然语言处理, 统计语言模型, 二叉树, 韵律短语, 最大熵, artificial intelligence, natural language processing, statistical language model, binary tree, prosodic phrase, Maximal Entropy Model

中文摘要：

句子的韵律短语识别是语音合成的重要研究内容.本文提出了应用统计语言模型生成的二叉树,结合最大熵方法识别待合成汉语句子的语音停顿点.文中给出了二叉树相关的模型训练和生成算法;二叉树与语音停顿点之间的关系;在最大熵方法中应用二叉树剪枝识别句子的韵律短语.实验结果表明,在搜索算法中,利用二叉树进行剪枝,可以很大程度上提高语音停顿预测的正确率和召回率,基于试验数据的f-Score提高了近35%.

英文摘要：

It is important to recognize the prosodic phrase breaks in text-to-speech. In this papcr,a new method is introduced for this purpose,which uses binary tree as pruning strategy in the Maximal Entropy Model （MaxEnt） framework. First of all, the concept of binary tree generated from a statistical language model is given. Then the process of generating the binary tree is discussed. In the process of applying MaxEnt to seeking optimal prosodic phrases, the binary tree is exploited so as to narrow the search space and improve the performance. Experimental results show that the F-score of predicating prosodic phrase breaks is about 35% better than the previous system, in which the binary tree strategy is not adopted.

同期刊论文项目

非母语写作水平计算机自动评测技术研究

期刊论文 5 会议论文 4

同项目期刊论文

HSK自动作文评分的特征选取研究

基于最大熵模型的汉语短语间停顿识别

基于标点信息和统计语言模型的语音停顿预测

基于分类回归树CART的汉语韵律短语边界识别