稀疏编码算法是一种常用的图像数据表示方法.为了处理高度非线性分布的数据,文中提出了一种核稀疏概念编码算法,并应用于图像表示.该算法首先对邻域图进行谱分析,提取数据的几何流形结构信息;然后将原始特征空间数据映射到高维特征空间中,利用谱回归在高维特征空间中来计算基向量;最后在高维特征空间中对每个样本逐个进行表示.文中算法不仅能有效地处理非线性结构数据,而且只需求解一个稀疏特征值问题和两个回归问题,计算简单有效.在Yale、ORL和PIE图像库的聚类实验表明,文中算法的准确率和归一化互信息均优于其它几种对比算法.
Sparse coding algorithm is a popular data representation method. In order to deal with the high nonlinear data, in this paper, a kernel sparse concept coding (KSCC) algorithm is proposed for image representation. Our algorithm performs spectral analysis on nearest neighbor graph and captures the geometric manifold structure of the data. Then the data in the origin feature space is mapped into the high-dimensional feature space and the basis vector in high-dimensional space is obtained using spectral regression. Finally, the samples are individually represented in high-dimensional feature space. Therefore, the proposed algorithm not only effectively handles the nonlinear structure data, but also needs to solve a sparse eigen-problem and two regression problems, which is very simple and effective. The experiments on Yale, ORL and PIE image datasets demonstrate that the accuracy and normalized mutual information of our proposed algorithm are superior to other comparison algorithms.