东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

Human Mouth-State Recognition Based on Image Warping and Sparse Representation Combined with Homotopy

时间：0
分类：TN911.73[电子电信—通信与信息系统;电子电信—信息与通信工程] O235[理学—运筹学与控制论;理学—数学]
作者机构：[1]School of Communication and Electronics, Jiangxi Science & Technology Normal University, Nanchang 330031, China, [2]College of Science and Technology, Nanchang University, Nanchang 330029, China, [3]Department of Electronic Information Engineering, Nanchang University, Nanchang 330031, China
相关基金：National Natural Science Foundation of China（No.61210306074）; Natural Science Foundation of Jiangxi Province,China（No.2012BAB201025）; the Scientific Program of Jiangxi Provincial Education Department,China（Nos.GJJ14583,GJJ13008）

中文摘要：

It is often necessary to recognize human mouth-states for detecting the number of audio sources and improving the speech recognition capability of an intelligent robot auditory system. A human mouth-state recognition method based on image warping and sparse representation( SR) combined with homotopy is proposed.Using properly warped training mouth-state images as atoms of the overcomplete dictionary overcomes the impact of the diversity of the mouths’ scales,shapes and positions so that further improvement of the robustness can be achieved and the requirement for a large number of training samples can be relieved. The homotopy method is employed to compute the expansion coefficients effectively,i. e.,for sparse coding. The orthogonal matching pursuit( OMP) is also tested and compared with the homototy method. Experimental results and comparisons with the state-of-the-art methods have proved the effectiveness of the proposed approach.

英文摘要：

It is often necessary to recognize human mouth-states for detecting the number of audio sources and improving the speech recognition capability of an intelligent robot auditory system. A human mouth-state recognition method based on image warping and sparse representation（ SR） combined with homotopy is proposed.Using properly warped training mouth-state images as atoms of the overcomplete dictionary overcomes the impact of the diversity of the mouths＇ scales,shapes and positions so that further improvement of the robustness can be achieved and the requirement for a large number of training samples can be relieved. The homotopy method is employed to compute the expansion coefficients effectively,i. e.,for sparse coding. The orthogonal matching pursuit（ OMP） is also tested and compared with the homototy method. Experimental results and comparisons with the state-of-the-art methods have proved the effectiveness of the proposed approach.

同期刊论文项目

基于视听觉信息融合的盲源分离

期刊论文 5 会议论文 2

同项目期刊论文

Audio-visual underdetermined blind source separation algorithm based on Gaussian potential function

基于随机分数梅林变换的非线性图像加密算法

基于随机分数梅林变换的光学图像加密算法

Flexible multiple-image encryption algorithm based on log-polar transform and double random phase en