东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

基于稀疏编码的鲁棒说话人识别

ISSN号：1004-9037
期刊名称：《数据采集与处理》
分类：TP391.42[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]河南理工大学计算机科学与技术学院,河南焦作454000, [2]中国科学院自动化研究所模式识别国家重点实验室,北京100190
相关基金：国家自然科学基金资助项目（91120303,90820303,90820011）;河南省基础与前沿技术研究计划资助项目（132300410332）

作者：何勇军[1], 孙广路[1], 付茂国[1], 韩纪庆[2]

关键词：语音识别, 随机段模型, 发音信息, 阶层式人工神经网路, 发音特征, speech recognition, stochastic segment model, articulatory information, hierarchical artificial neural network, articulatory feature

中文摘要：

提出了一种基于随机段模型的发音信息集成方法。根据随机段模型的模型特性，建立了阶层式人工神经网络来获取语音段信号属于各类音素的后验概率，并通过一遍解码的方式集成到随机段模型系统中。在“863-test”测试集上进行的汉语连续语音识别实验显示汉语字的相对错误率下降了5．93％。实验结果表明了将发音信息应用到随机段模型的可行性。

英文摘要：

This paper proposed a framework which attempted to incorporate articulatory information into the stochastic segment model based on Mandarin speech recognition system.According to the characteristics of the stochastic segment model,it used hierarchical artificial neural network to obtain the posterior probability of speech signal belonging to the phonemes.Then,it integrated the posterior probability into the stochastic segment model system in the first search process.Experiments conducted on “863-test”set show that about 5 .93% relative improvement could be achieved in the recognition accuracy.Thus,it de-monstrates the feasibility of the method.

同期刊论文项目

行车环境听觉模型及声音处理关键技术

期刊论文 41 会议论文 55 著作 2

基于稀疏编码的语音特征增强方法研究

期刊论文 3

复杂条件下交通标识图文检测、识别与理解

期刊论文 64 会议论文 27

同项目期刊论文

采用听觉滤波器的宽带MUSIC声源定位方法

汉语和英语音高重音自动标注法方法的对比分析

Integrating Binary Mask Estimation with MRF Priors of Cochleagram for Speech Separation

Statistical voice activity detection based on sparse representation over learned dictionary

基于噪音追踪的二值时频掩蔽到实值掩蔽的泛化算法

Auditory filter based broadband MUSIC algorithm for sound source localization

Optimization of Learned Dictionary for Sparse Coding in Speech Processing

A signal subspace dimension estimator based on F-norm with application to subspace-based multi-chann

The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio.

The analysis of the simplification from the ideal ratio to binary mask in signal-to-noise ratio sens

A new Bayesian method incorporating with local correlation for IBM estimation

Latent topic model for audio retrieval

A novel signal subspace dimension estimator based on F-norm with application to subspace-based multi

Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection

A new framework for robust speech recognition in complex channel environments

Noise Robust Direction of Arrival Estimation for Speech Source With Weighted Bispectrum Spatial Corr

Spectrum enhancement with sparse coding for robust speech recognition

Dictionary evaluation and optimization for sparse coding based speech processing

Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptati

Soft Margin Based Low-Rank Audio Signal Classification

行车噪声环境下基于人耳频率选择特性的声学特征提取方法

Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models

Fast Audio Retrieval Using Symbolized LSH Address Based on p-stable Distribution

基于p-稳定分布局部敏感哈希地址的鲁棒音频检索方法

融合引导概率的语音识别解码算法研究

SPARSE BASED AUDITORY MODEL FOR ROBUST SPEAKER RECOGNITION

Integrating Induced Probability into Decoding for Large Vocabulary Continuous Speech Recognition

基于深度学习语音分离技术的研究现状与进展

鲁棒声学事件检测综述

基于Fisher判别字典学习的说话人识别

Non-negative Matrix Factorization for Hyperspectral Unmixing Using Prior Knowledge of Spectral Signa

On the sample complexity of random fourier features for online learning: How many random fourier fea

Real-Time Traffic Light Detection With Adaptive Background Suppression Filter

Online Kernel Learning with Nearly Constant Support Vectors

Damping proximal coordinate descent algorithm for non-convex regularization

Image Deblurring with Coupled Dictionary Learning

Incremental Multiple Instance Outlier Detection

Audio Classical Composer Identification by Deep Neural Network

Sparse hyperspectralunmixing using an approximate L0 norm

Convex relaxation based sparse algorithm for hyperspectral target detection

Nonlocal Similarity Regularized Sparsity Model for Hyperspectral Target Detection

机器学习面临的挑战

Hyperspectral image fusion based on sparse constraint NMF

On the Sample Complexity of Random Fourier Features for Online Learning

Hinge Loss Stochastic Gradient Descent for Training Convolutional Neural Networks

Nonnegative matrix factorization-based hyperspectral and panchromatic image fusion

Regularized Simultaneous Forward-Backward Greedy Algorithm for Sparse Unmixing of Hyperspectral Data

A regularized non-Gaussianity based multiple-target detector for hyperspectral images

Hierarchical Interactions Model for Predicting Mild Cognitive Impairment (MCI) to Alzheimer'

Ship Detection in High-Resolution Optical Imagery Based on Anomaly Detector and Local Shape Feature

Non-negative Matrix Factorization for HyperspectralUnmixing Using Prior Knowledge of Spectral Signat

Vehicle detection in remote sensing imagery based on salient information and local shape feature

A novel unsupervised approach to discovering regions of interest in traffic images

Scene Learning for Cloud Detection on Remote-Sensing Images

Hierarchical Interactions Model for Predicting Mild Cognitive Impairment (MCI) to Alzheimer's D

Relaxed sparse eigenvalue conditions for sparse&nbs

Large-Scale Eigenvector Approximation via Hilbert Space Embedding Nystrom

Solving one-class problem with outlier examples by SVM

Regularized Tree Partitioning and Its Application to Unsupervised Image Segmentation

Robust Hyperspectral Image Target Detection Using An Inequality Constraint

Sparse Unmixing of Hyperspectral Data Using Spectral a Priori Information

Multi-scale retinex improvement for nighttime image enhancement

SparseCEM and SparseACE for Hyperspectral Image Target Detection

Airplane detection based on rotation invariant and sparse coding in remote sensing images

Single image dehazing in inhomogeneous atmosphere

An improved-SFIM fusion method based on the calibration process

Hyperspectral image fusion by multiplication of spectral constraint and NMF

An inner-product based discriminative IRLS algorithm for sparse hyperspectral detection

An automated airplane detection system for large panchromatic image with high spatial resolution

Subspace Matching Pursuit for Sparse Unmixing of Hyperspectral Data

Multiple rank multi-linear SVM for matrix data classification[J]

Multi-Stage Multi-Task Feature Learning

Hyperspectral and panchromatic image fusion using unmixing-based constrained nonnegative matrix fact

Efficient sparse unmixing analysis for hyperspectral imagery based on random projection

Large-scale eigenvector approximation via Hilbert Space Embedding Nyström

Single remote sensing image dehazing

Nonnegative matrix factorization based hyperspectral and panchromatic image fusion

基于sc- NMF的高光谱图像融合

数据不充分情况下的说话人识别

基于噪声分类和字典选择的语音活动检测

期刊信息

《数据采集与处理》
北大核心期刊（2011版）

主管单位:中国科学技术协会
主办单位:中国电子学会仪器仪表学会信号处理学会中国一汽仪表学会中国物理学会微弱信号检测学会南京航空航天大学
主编：贲德
地址：南京市御道街29号
邮编：210016
邮箱：sjcj@nuaa.edu.cn
电话：025-84892742

国际标准刊号：ISSN：1004-9037
国内统一刊号：ISSN：32-1367/TN
邮发代号:28-235

获奖情况:
中国科技论文统计源用刊,2007年被评为江苏省优秀期刊

国内外数据库收录:
俄罗斯文摘杂志,荷兰文摘与引文数据库,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）

被引量:8148