针对无监督属性选择算法无类别信息和未考虑属性的低秩问题,提出一种基于自表达方法的低秩属性选择算法。在损失函数中使用低秩和自表达方法描述属性间的相关结构,利用K均值聚类算法得到所有样本的伪类标签进行属性选择,采用稀疏学习方法中的l2,p-范数参数p控制属性选择结果的稀疏性,并通过子空间学习方法使属性选择结果达到全局最优。实验结果表明,与无监督属性选择算法相比,该算法在6个公开数据集上均具有较高的分类准确率及稳定性。
Since unsupervised feature selection algorithms do not have label information and also ignore the low-rank characteristics of the data,this paper proposes a new low-rank feature selection algorithm based on self-representation method. In the loss function,low rank and self-representation methods are used to describe the correlation structure between features, and the K-means clustering method is used to obtain the pseudo labels of samples to realize feature selection. Then,l2,p-norm parameter p in sparse learning method is adopted to control the sparsity of feature selection results. Through subspace learning method,the result of feature selection is globally optimal. The experimental results on six public datasets demonstrate that the proposed feature selection algorithm has higher classification accuracy and better stability compared with the unsupervised feature selection algorithm.