东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

面向图像分类的核主成分分析网络

ISSN号：1003-7985
期刊名称：《东南大学学报：英文版》
时间：0
分类：TN911.72[电子电信—通信与信息系统;电子电信—信息与通信工程]
作者机构：[1]东南大学计算机网络和信息集成教育部重点实验室,南京210096, [2]法国国家医学与健康研究院U1099,雷恩35000, [3]雷恩一大信号与图像处理实验室,雷恩35000, [4]中法生物医学信息研究中心南京210096
相关基金：国家自然科学基金（61201344,61271312,61401085）,高等学校博士学科点专项科研基金（20120092120036）

作者：吴丹[1,4] 伍家松[1,2,3,4] 曾瑞[1,4] 姜龙玉[1,4] Lotfi Senhadji[2,3,4] 舒华忠[1,4]

关键词：信号处理, 深度学习, 卷积神经网络, 快速傅里叶变换, Signal processing, Deep learning, Convolutional Neural Network （CNN）, Fast Fourier Transform （FFT）

中文摘要：

卷积神经网络在语音识别和图像识别等众多领域取得了突破性进展，限制其大规模应用的很重要的一个因素就是其计算复杂度，尤其是其中空域线性卷积的计算。利用卷积定理在频域中实现空域线性卷积被认为是一种非常有效的实现方式，该文首先提出一种统一的基于时域抽取方法的分裂基-2/（2a） 1维FFT快速算法，其中a为任意自然数，然后在CPU环境下对提出的FFT算法在一类卷积神经网络中的加速性能进行了比较研究。在MNIST手写数字数据库以及Cifar-10对象识别数据集上的实验表明：利用分裂基-2/4 FFT算法和基-2 FFT算法实现的卷积神经网络相比于空域直接实现的卷积神经网络，精度并不会有损失，并且分裂基-2/4能取得最好的提速效果，在以上两个数据集上分别提速38.56%和72.01%。因此，在频域中实现卷积神经网络的线性卷积操作是一种十分有效的实现方式。

英文摘要：

Convolution Neural Networks （CNN） make breakthrough progress in many areas recently, such as speech recognition and image recognition. A limiting factor for use of CNN in large-scale application is, until recently, their computational expense, especially the calculation of linear convolution in spatial domain. Convolution theorem provides a very effective way to implement a linear convolution in spatial domain by multiplication in frequency domain. This paper proposes an unified one-dimensional FFT algorithm based on decimation-in-time split- radix-2/（2a）, in which a is an arbitrary natural number. The acceleration performance of convolutional neural network is studied by using the proposed FFT algorithm on CPU environment. Experimental results on the MNIST database and Cifar-10 database show great improvement when compared to the direct linear convolution based CNN with no loss in accuracy, and the radix-2/4 FFT gets the best time savings of 38.56% and 72.01% respectively. Therefore, it is a very effective way to realize linear convolution operation in frequency domain.

同期刊论文项目