东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

一种新的有监督流形学习方法

期刊名称：计算机研究与发展. 44(12)(2007), 2072-2077
时间：0
分类：TP391.4[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]西安交通大学信息与系统科学研究所,西安710049, [2]西安交通大学电子与信息工程学院,西安710049
相关基金：国家自然科学基金项目（70531030,60575045） Manifold learning methods for nonlinear dimensionality reduction have attracted more and more attentions in the recent decade due to their excellent performance especially on unsupervised learning and data visualization. However, these methods still can＇t be applied to supervised learning problem very efficiently. In this paper, a new supervised manifold learning method is proposed by integrating SVM and manifold learning methods. Because of the prominent properties of both adopted methods, the new method has excellent ability to deal with the supervised learning problem. Our work is supported by the projects of the National Natural Science Foundation of China （70531030 and 60575045）.
相关项目：关于支撑向量回归机的模型选择问题

关键词：流形学习方法, 支撑向量机, 等距特征映射, 局部线性嵌入, 分类, manifold learning, support vector machine, Isomap, LLE, classification

中文摘要：

提出了一种新的有监督流形学习方法，目的是提供将流形学习降维方法高效应用于有监督学习问题的全新策略．算法的核心思想是集成流形学习方法对高维流形结构数据的降维有效性与支撑向量机（SVM）在中小规模分类数据集上的优良特性实现高效有监督流形学习．算法具体实现步骤为：首先利用SVM在流形学习降维数据中选出对分类决策最重要的数据集，即支撑向量集；按标号返回可得到原空间的支撑向量集；在这个集合上再次使用SVM即可得到原空间的分类决策，从而完成有监督流形学习．在一系列人工与实际数据集上的实验验证了方法的有效性．

英文摘要：

A new supervised manifold learning method is proposed in this paper, in order to present a new strategy to efficiently apply manifold learning and nonlinear dimensionality reduction methods to supervised learning problems. The new method realizes efficient supervised learning mainly based on integrating the topology preserving property of the manifold learning methods （Isomap and LLE） and some prominent properties of support vector machine such as efficiency on middle and small sized data sets and essential capability of support vectors calculated from support vector machine. The method is realized via the following steps： first to apply Isomap or LLE to get the embeddings of the original data set in the low dimensional space; then to obtain support vectors, which are the most significant and intrinsic data for the final classification result, by using support vector machine on these low dimensional embedding data; subsequently to get support vectors in the original high dimensional space based on the corresponding labels of the obtained low dimensional support vectors; finally to apply support vector machine again on these high dimensional support vectors to gain the final classification discriminant function. The good performance of the new method on a series of synthetic and real world data sets verifies the feasibility and efficiency of the method.

同期刊论文项目