东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

基于混合模型状态修正算法的非母语语音识别

期刊名称：数字通信
时间：0
页码：33-37
语言：中文
分类：TP391.4[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术] TS190.642[轻工技术与工程—纺织化学与染整工程;轻工技术与工程—纺织科学与工程]
作者机构：[1]中国科学院声学研究所,北京100080
相关基金：基金项目：国家高技术研究发展计划（863计划,2006AA010102,2006AA01Z195）;国家重点基础研究发展窥划项目计划（973计划,2004CB318106）;国家自然科学基金（No.10574140,60535030）
相关项目：口吃语音的自动评估和矫正方法研究

作者：张晴晴|颜永红|潘接林|

关键词：非母语语音识别, 模型修正, 混合模型, nonnative speech recognition, model modification, mixed model

中文摘要：

非母语语音识别的性能较低，对于刚开始学习目标语言的说话人或者口音很重的说话人而言，性能下降更为明显。本文提出一种新型的双语模型修正算法用于提高非母语语音的识别性能。在该算法中，基线声学模型的每个状态都将被代表说话人母语特点的辅助模型状态所修正。文章给出了状态修正准则以及不同候选修正状态数下的性能比较。相比已用非母语训练数据自适应以后的基线声学模型，通过双语模型修正的声学模型在保证识别实时率的前提下，短语错误率相对下降了11．7％。

英文摘要：

The performance of automatic speech recognition decreases drastically for nonnative speakers, especially those who are just beginning to learn foreign language or who have heavy accents. A novel bilingual model modification approach is presented to improve nonnative speech recognition accuracy. Each state of baseline nonnative acoustic model is modified with several candidate states from the auxiliary acoustic model, which is trained by speakers＇ mother tongue. State mapping criterion and n-best candidates are investigated. Using this bilingual model modification approach, compared to the nonnative acoustic model which has already been well trained by adaptation technique MAP, the phrase error rate further is reduced by 11.7% relatively, while only a small relative increase on real time factor occurs.

同期刊论文项目

　多语言自然口语对话系统关键技术研究

期刊论文 3

口吃语音的自动评估和矫正方法研究

期刊论文 18 会议论文 14

汉语与英语听感知差异机理及应用研究

期刊论文 88 会议论文 112 著作 1

基于声学分析的无创性嗓音质量客观评估方法研究

期刊论文 29 会议论文 9 著作 1

同项目期刊论文

基于听觉感知子带的自适应谷点形成算法

Speech endpoint detection in real noise environments.

Atwo-element-microphone-array-based speech recognition system in vehicle environment

汉语嗓音障碍样本数据库

Automatic language identification with discriminative language characterization based on SVM

Robust Speaker Clustering Using Affinity Propagation

Robust Glottal Source Estimation Based on Joint Source-filter Model Optimization

Effect of the temporal fine structure in different frequency bands on Mandarin tone perception

An improved cochlear implant strategy incorporating frequency modulation information

音频信号截幅失真的检测和修复

语前聋青少年人工耳蜗植入发音清晰度与嗓音声学参数的分析.

声带肉芽肿的发生机制及治疗原则.

嗓音障碍主观听感知评估研究现状.

基于联合源-滤波器模型优化的语音声门源模型估计方法

Speech Enhancement Using Compact Microphone Array and Applications in Distant Speech Acquisition

语前聋青少年人工耳蜗植入发音清晰度与嗓音声学参数的分析

声带突肉芽肿的发生机制及治疗原则

音频信号截幅失真的检测与修复

交叉对数似然度和贝叶斯信息判据的说话人聚类算法

基于人耳听觉模型的自动嗓音评估方法

汉语自然口语中声调识别的研究

中英双语混合语音识别研究

基于能量和浊音特性的语音端点检测

一种基于联合源-滤波器模型优化的语音声门源模型估计方法

Cochlear implant signal processing algorithm based on frequency modulation

Development of a Mandarin-English Bilingual Speech Recognition System with Unified Acoustic Models

嵌入式语音识别中一种高效的图搜索算法

Speech Enhancement Using Improved Adaptive Null-Forming in Frequency Domain with Postfilter

基于人耳听觉模型的自动嗓音评估方法.

复杂噪声环境中的语音端点检测，

An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns

最小方差无失真响应感知倒谱系数在说话人识别中的应用

成年口吃者流畅朗读中塞音的声学分析

多特征融合的英语口语考试自动评分系统的研究

用于语音识别置信度的发音特征各维度分析和子集优化

联合因子分析和稀疏表示在稳健性说话人确认中的应用

基于在线语音流的字幕自动生成系统算法研究与实现

英语篇章朗读质量的自动评分

用于版权管理的数字音频水印算法

语言声学与内容理解研究进展

Acoustic characteristics of stop consonants in fluent reading Chinese Putonghua speech of adult stutterers

细菌性脑膜炎致聋研究进展

外侧杏仁核在恐惧性条件化学习和记忆中的作用

基于网络的无监督MLLR?自适应算法实现及改进

Optimal?Time-Reversal?Focusing?by?an?Iterative?Least?Squares?Method

Perceptual?integration?between?target?speech?and?target-speech?reflection?reduces?masking?for?target

Both?frequency?and?interaural?delay?affect?ERP?responses?to?binaural?gap

An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference

Automatic Singing Performance Evaluation for Untrained Singers

A two-element-microphone-array-based speech recognition system in vehicle environment

Discrimination between pathological normal voices using GMM-SVM approach

基于频率调制信息的人工耳蜗语音处理算法研究

语言声学进展及其应用

Precedence?effect-induced?enhancement?of?prepulse?inhibition?in?socially?reared?but?not?isolation-re

Auditory?frequency-following?responses?in?rat?ipsilateral?inferior?colliculus

The?dual-pathway?model?of?auditory?signal?processing

A?robust?real-time?decoder?using?memory-efficient?state?network

Automatic?language?identification?with?discriminative?language?characterization?based?on?SVM

Using SVM as Back-End Classifier for Language Identification

Robust?Speaker?Clustering?Using?Affinity?Propagation

Effective?Acoustic?Modeling?for?Pronunciation?Quality?Scoring?of?Strongly?Accented?Mandarin?Speech

Melody?Track?Selection?Using?Discriminative?Language?Model

The?effect?of?voice?cuing?on?releasing?Chinese?speech?from?informational?masking

The?experimental?investigation?on?the?reproduction?of?audible?sound?from?two?ultrasonic?beams

Effect?of?the?temporal?fine?structure?in?different?frequency?bands?on?Mandarin?tone?perception

Chronic?administration?of?clozapine?alleviates?reversal?learning?impairment?in?isolation?reared?rats

A Fast Calculation Method for the Scattered Sound Fields from Two Sound Beams Using the Gaussian-Bea

Generation of Audible Sound from Two Ultrasonic Beams with Dummy Head

Speech Enhancement Using Compact Microphone Array Applications in Distant Speech Acquisition

Development of a Mandarin-English Biligual Speech Recognition System with Unified Acoustic Models

Estimation?of?ICTD?in?Frequency?Sub-bands?Based?on?NDFT

Development?of?a?Mandarin-English?Bilingual?Speech?Recognition?System?for?Real?World?Music?Retrieval

“鸡尾酒会”环境中的知觉线索的去掩蔽作用

嵌入式语音识别中一种高效的图搜索算法

基于听觉感知子带的自适应谷点形成算法

Fast?fuzzy?keyword?spotting?using?syllable?confusion?network

一种改进的基于层次聚类的说话人自动聚类算法

窄带的自同步音频水印算法

Ultrasonic Intruder Detection System for Home Security

Orthogonal Relief Algorithm for Feature Selection

The?influence?of?the?perceptual?or?fear?learning?on?rats’?prepulse?inhibition?induced?by?changes?in?

Distance-dependent Head-related Transfer Functions Measured with High Spatial Resolution Using a Spa

Emotional learning enhances stimulus-specific top-down modulation of sensorimotor gating in socially

Top-down modulation of prepulse inhibition the startle reflex in humans and rats

Detection of the break in interaural correlation is affected by interaural delay, aging, and center

Product HMM based training method for acoustic model with multiple size units

Target-Oriented?Acoustic?Radiation?Generation?Technique?for?Sound?Field?Control

The investigation of localized sound generation using two ultrasound beams

Speech?endpoint?detection?in?real?noise?environments

Auditory?fear?conditioning?modulates?prepulse?inhibition?in?socially-reared?rats?and?isolation-reare

Speech?Enhancement?Using?Improved?Adaptive?Null-?Forming?in?Frequency?Domain?with?Postfilter

Rapid?Adaptation?Based?on?Regression?Analysis

复杂噪声环境中的语音端点检测

一种任务域无关的语音关键词检测系统

Metabotropic?glutamate?subtype?5?receptors?modulate?fear-conditioning?induced?enhancement?of?prepuls

Auditory?evoked?responses?in?the?rat:?transverse?subdermal?electrodes?register?before?cochlear?nucle

Ultrasonic?auditory?evoked?response?recorded?in?the?rat’s?cochlear?nucleus

Human?Face?Classification?Using?Ultrasonic?Sonar?Imaging

Binaural?unmasking?of?frequency-following?responses?in?rat?amygdale

Product HMM-based training method for acoustic model with multiple-size units

The First International Symposium on Neurobehavioral Science in China (Editorial)

Simulated?phase-locking?stimulation:?an?improved?speech?processing?strategy?for?cochlear?implant

An?improved?cochlear?implant?strategy?incorporating?frequency?modulation?information

Using A Kind of Novel Phonotactic Information for SVM based Speaker Recognition

Metabotropic?glutamate?receptors?subtype?5?are?necessary?for?the?enhancement?of?auditory?evoked?pote

Rapid?calculations?of?the?scattered?sound?fields?generated?by?two?sound?beams

音频信号截幅失真的检测与修复

交叉对数似然度和贝叶斯信息判据的说话人聚类算法

基于人耳听觉模型的自动嗓音评估方法

汉语自然口语中声调识别的研究

中英双语混合语音识别研究

基于概念网络的短文本分类方法

汉语、英语听感知差异及适合汉语的人工耳蜗编码策略

特定领域的汉语语言模型平滑算法比较研究

一种基于联合源-滤波器模型优化的语音声门源模型估计方法

Cochlear implant signal processing algorithm based on frequency modulation

中英双语混合语音识别研究

Cochlear implant signal processing algorithm based on frequency modulation