位置:成果数据库 > 期刊 > 期刊详情页
互学习神经网络训练方法研究
  • ISSN号:0254-4164
  • 期刊名称:《计算机学报》
  • 时间:0
  • 分类:TP18[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
  • 作者机构:[1]辽宁工程技术大学数学与系统科学研究所,辽宁阜新123000, [2]辽宁工程技术大学智能工程与数学研究院,辽宁阜新123000, [3]辽宁工程技术大学矿业学院,辽宁阜新123000
  • 相关基金:本课题得到国家自然科学基金(51304114,71371091)资助.
中文摘要:

由于BP神经网络具有表达能力强,模型简单等特点,经过近30年的发展,在理论和应用研究上都取得了巨大的进步,然而容易陷入局部最优和泛化能力差等问题却限制了神经网络的发展.同时,大数据的出现和深度学习算法的提出与应用,为神经网络向更类脑的方向发展提出了新的要求.针对上述问题,该文从模拟生物双向认知能力的角度出发,构造了一种新的神经网络模型——互学习神经网络模型,该模型在标准正向神经网络的基础上,引入了与其具有结构对称性的负向神经网络,利用正、负向神经网络分别模拟生物的顺向和逆向认知过程,并在此基础上提出了一种新的神经网络训练方法——互学习神经网络训练方法,该方法通过网络连接权值转置共享,正、负双向交替训练的方式对互学习神经网络模型进行训练,从而实现输入数据和输出标签之间的相互学习,使网络具有双向认知能力.实验表明,互学习神经网络训练方法可以同时训练正、负两个神经网络,并使网络收敛.同时,在此基础上提出了“互学习预训练+标准正向训练”的两阶段学习策略和相应的转换学习方法,这种转换学习方法起到了和“无监督预训练+监督微调”相同的效果,能够使网络训练效果更好,是一种快速、稳定、泛化能力强的新型神经网络学习方法.

英文摘要:

Since BP neural network is expressive and model is simple, there has been a great improvement in both theoretical and applied research over the past 30 years. But its development is held back due to model limitations on local optimism and overfitting. With the emergence of big data and application of deep learning, there are new requirements which gears neural network development towards more pseudo brain. To solve the above problem, this paper presents a new neural network model based on the simulation of biological bidirectional cognitive ability: the mutual learning neural network model. The design of the mutual learning neural network model originates from human being's bidirectional cognitive ability, that is, the forward cognitive ability and the backward cognitive ability, and the previous one possesses the cause and demands the result while conversely the latter one possesses the result and demands the cause. The mutual learning neural network model is composed of the positive neural network and the negative neural network. The positive neural network is a feedforward neural network with a hidden layer, which is used to set up the cognitive relationship from the cause(data)to the result(label)and simulate the forward cognitive ability. The negative neural network has the symmetrical relationship with the positive neural network, and it's mainly used to set up the cognitive relationship from the result(label)to the cause(data)and simulate the backward cognitive ability. These two neural networks are combined together by weight sharing and construct the neural network model together, which simulates human being's bidirectional cognitive procedure. Based on it, this paper proposes a new training method of neural network: mutual learning neural network training method. Firstly, the input of the mutual learning neural network trainirtg method is the data and output of it is the label, which trains the positive neural network through BP learning algorithm. After a certain times of trai

同期刊论文项目
同项目期刊论文
期刊信息
  • 《计算机学报》
  • 北大核心期刊(2011版)
  • 主管单位:中国科学院
  • 主办单位:中国计算机学会 中国科学院计算技术研究所
  • 主编:孙凝晖
  • 地址:北京中关村科学院南路6号
  • 邮编:100190
  • 邮箱:cjc@ict.ac.cn
  • 电话:010-62620695
  • 国际标准刊号:ISSN:0254-4164
  • 国内统一刊号:ISSN:11-1826/TP
  • 邮发代号:2-833
  • 获奖情况:
  • 中国期刊方阵“双效”期刊
  • 国内外数据库收录:
  • 美国数学评论(网络版),荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版),中国北大核心期刊(2000版)
  • 被引量:48433