尝试将一种基于图的Fiedler向量的聚类算法引入到基因表达谱数据的肿瘤分类中来。该方法将分属不同类的所有样本通过高斯权构造Laplace完全图,经SVD分解后获得Fiedler向量,利用各样本所对应的Fiedler向量分量的符号差异来进行基因表达谱数据的分类。通过模拟数据仿真实验和对白血病两个亚型(ALL与AML)及结肠癌真实数据实验,证明了这一方法的有效性。
An algorithm for classification of gene expression data based on Fiedler Vector was proposed.Firstly,the Laplacian matrix of complete graph is constructed on all the different types of gene expression data.Then,the Fiedler Vector is obtained by the singular value decomposition of this Laplacian matrix.Finally,the samples are divided into two classes by utilizing the signs of the Fiedler Vector components.The effectiveness of this algorithm has been proven by simulation experiment and real data experiment.