东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于听觉模型与自适应分数阶Fourier变换的声学特征在语音识别中的应用

ISSN号：0371-0025
期刊名称：声学学报(中文版)
时间：2012.1.1
页码：97-103
分类：TN912.34[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]北京理工大学信息与电子学院,北京100081
相关基金：国家科技重大专项课题（2010ZX03004-003-01）、国家自然科学基金（90920304）和教育部博士点基金（20101101110020）、国家自然科学基金（60605015）和2009年深圳市南山区科技研发资金资助项目.
相关项目：无人驾驶车辆智能行为综合测试环境设计与测评体系研究

关键词：分数阶FOURIER变换, 自动语音识别系统, 声学特征, 听觉模型, CHIRP信号, 自适应, 应用, 瞬时频率, Audition, Signal processing

中文摘要：

分数阶Fourier变换在处理非平稳信号尤其是chirp信号方面有着独特的优势，而人耳听觉系统具有自动语音识别系统难以比拟的优良性能。本文采用Gammatone听觉滤波器组对语音信号进行前端时域滤波，然后对输出的各个子带信号用分数阶Fourier变换方法提取声学特征。分数阶Fourier变换的阶数对其性能有着重要影响，本文针对子带时域信号提出了采用瞬时频率曲线拟合求取阶数的方法，并将其与采用模糊函数的方法作了比较。在干净与含噪汉语孤立数字库上的语音识别结果表明，采用新提出的声学特征得到的识别正确率相对MFCC基线系统有了显著提高；根据瞬时频率曲线搜索阶数的算法与模糊函数方法相比，计算量大大减少，并且根据该方法提取的声学特征得到了最高的平均识别正确率。

英文摘要：

It is well known that auditory system of human beings has excellent performance with which automatic speech recognition （ASR） systems can＇t match, and fractional Fourier transform （FrFT） has unique advantages in nonstationary signal processing. In this paper, the Gammatone filterbank is applied to speech signals for front-end temporal filtering, and then acoustic features of the output subband signals are extracted based on fractional Fourier transform. The transform order is critical for FrFT. An order adaptation method based on the instantaneous frequency is proposed, and its performance is compared with the method based on ambiguity function. ASR experiments are conducted on clean and noisy Mandarin digits, and the results show that the proposed features achieve significantly higher recognition rate than the MFCC baseline, and the order adaptation method based on instantaneous frequency has much lower complexity than that based on ambiguity function. Further more, the FrFT-based features achieve the highest recognition rate using the proposed order adaptation method.

同期刊论文项目

无人驾驶车辆智能行为综合测试环境设计与测评体系研究

期刊论文 24 会议论文 36 著作 1

基于分数阶傅立叶变换的语音信号分析与应用研究

期刊论文 8 会议论文 8 专利 2

同项目期刊论文

基于 Fuzzy-EAHP 的无人驾驶车辆智能行为评价

Design and Implementation of a Miniature Intelligent Vehicle Test Platform

The Fuzzy-AHP Evaluation Method for Unmanned Ground Vehicles

A Head-Up Display-Based P300 Brain-Computer Interface for Destination Selection

Test and Evaluation of Autonomous Ground Vehicles

智能车辆路径跟踪横向控制方法的研究

智能车辆视觉里程计算法研究进展

基于CarSim和Matlab的智能车辆视觉里程计仿真平台设计

Variable dimensional state space based global path planning for mobile robot

Detection and Tracking of Moving Objects at Intersections Using a Network of Laser Scanners

An iterative linear quadratic regulator based trajectory tracking controller for wheeled mobile robo

基于变维度状态空间的增量启发式路径规划方法研究

Queuing Network Modeling of Driver Lateral Control With or Without a Cognitive Distraction Task

基于混沌理论的无人驾驶车辆行驶轨迹量化分析

基于Zig Bee的无线扭矩测量系统研究

并线工况下车载雷达有效目标快速检测方法

基于Fuzzy-EAHP的无人驾驶车辆智能行为评价

不确定和时滞扰动下的车道保持自校正滑模控制

自动转向滑模变结构控制参数选取方法

An iterative linear quadratic regulator based trajectory tracking controller for wheeled mobile robot

耳语数据库的设计与采集

耳语音数据库的设计与采集

Improved Spectral Representation for Birdcall Based on Fractional Fourier Transform

Pitch- and formant-based order adaptation of the fractional Fourier transform and its application to

Hadamard纠错码结合支持向量机在多分类问题中的应用

Improved Spectral Representation for Birdcall Based on Fractional Fourier Transform

Design and implementation of scenic-spot introduction-task-oriented 3D virtual human spoken dialogue system

基于改进DBSCAN算法的激光雷达车辆探测方法

期刊信息

《声学学报》
中国科技核心期刊

主管单位:中国科学院
主办单位:中国科学院声学研究所
主编：王小民
地址：北京北四环西路21号
邮编：100190
邮箱：
电话：010-62558329

国际标准刊号：ISSN：0371-0025
国内统一刊号：ISSN：11-2065/O4
邮发代号:2-181

获奖情况:
中国期刊方阵“双效”期刊

国内外数据库收录:
荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,美国应用力学评论,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:8376