东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

模型的固有复杂度和泛化能力与几何曲率的关系

ISSN号：0254-4164
期刊名称：《计算机学报》
时间：0
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]北京交通大学计算机与信息技术学院,北京100044
相关基金：本课题得到国家自然科学基金（60373029）、教育部博士点基金（20050004001）和北京市重点学科共建项目基金资助.

关键词：模型选择, 泛化能力, 固有复杂度, 统计流形, Gauss-Kroneker曲率, model selection, generalizability, intrinsic complexity, statistical manifold, GaussKroneker curvature

中文摘要：

从微分几何角度考察与参数化形式无关的统计模型流形的固有复杂度,指出模型流形的Gauss-Kroneker曲率可以完全刻画模型流形在一点处的全部性质,进而分析了曲率与体积的关系;给出了基于参数估计量邻域附近的解轨迹方法的曲率计算方法;证明了用于衡量泛化能力的未来残差可以用模型的曲率来表示,由此给出一种新的以曲率度量模型复杂度的模型选择准则GKCIC;对几何方法和统计学习理论进行了分析比较.在人工数据集和真实数据集上的比较实验结果表明了文中提出的方法的有效性.

英文摘要：

The paper uses the conception of curvature from the point of view of differential geometry to explore the intrinsic model complexity that is free of reparametrization; and then through theoretical analysis, shows that the Gauss-Kroneker curvature can describe the whole properties of the statistical manifold, thus gives the relation between curvature and the volume of the manifold. An algorithm is proposed based on study of the solution locus in the neighborhood of the expectation of parameters to calculate the curvature of the model. This paper proves that the future residual that is qualified to measure the generalizability can be expressed by using the intrinsic curvature array of model, from which a new model selection criterion GKCIC is given. It not only considers the factors such as the number of parameters, sample size and functional form, but also with very clear and intuitive geometric understanding of model selection. The geometrical method of the statistical manifold is compared with the statistical learning theory, in particular, the VC dimension versus the Gauss-Kroneker curvature. By running the algorithm on synthetic and real datasets, the author argue that the GKCIC work efficiently.

同期刊论文项目

基于人类视觉感知系统的有效编码模型

期刊论文 27 会议论文 43 著作 1

同项目期刊论文

基于模型的层次化强化学习算法

主曲线构建算法研究

修剪算法的信息几何分析

退火期望最大化算法A-EM

人工神经网络知识增殖性分析

基于最大判别熵的有监督独立分量

基于决策树的神经网络

实现人工神经网络知识增殖能力的

A Visual Perceptual Grouping A

基于谱图理论的流形学习算法

Samples Selection in Semi-Supe

基于注意机制的稀疏编码模型

高维数据流形的低维嵌入及嵌入维

基于图像统计特性的格式塔规则量

利用多尺度分析和编组的基于目标

信息理论框架下的神经网络构建

基于视觉系统“what”和“where

基于测地线距离的广义高斯型Laplacian特征映射

基于视觉系统“What”和“Where”通路的图像显著区域检测

动态增殖流形学习算法

基于非线性维数缩减的复杂网络聚类可视化

全局显著结构主导下的知觉编组算法

利用多尺度分析和编组的基于目标的注意计算模型

基于what和where信息的目标检测方法

期刊信息

《计算机学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国计算机学会中国科学院计算技术研究所
主编：孙凝晖
地址：北京中关村科学院南路6号
邮编：100190
邮箱：cjc@ict.ac.cn
电话：010-62620695

国际标准刊号：ISSN：0254-4164
国内统一刊号：ISSN：11-1826/TP
邮发代号:2-833

获奖情况:
中国期刊方阵“双效”期刊

国内外数据库收录:
美国数学评论（网络版）,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:48433