应用DNA微阵列数据对癌症进行诊断与分型,已经逐渐成为生物信息学领域的研究热点之一。首先概述了基于微阵列数据的癌症分类问题的研究现状与发展趋势。然后简要介绍了微阵列实验的基本步骤,微阵列数据的结构、特点以及用于癌症分类的基本流程。接下来重点从数据预处理、特征基因选择、分类器设计以及分类性能评价等几方面对近10年来的研究成果进行了详细的综述与比较分析。最后,对该领域目前仍然存在的问题进行了归纳并对未来可能的研究方向作出了预测与展望。
Applying DNA microarray data to diagnose for cancer and recognize different subtypes of the same tumor has been becoming one of hot topics in Bioinformatics.Firstly,this paper summarized the state of the art and the development trend for cancer classification based on microarray data.Then the basic procedure of DNA microarray experiments,structure characteristics of microarray data and general process for cancer classification based on DNA microarray data were introduced.After that,a detailed survey and systemic comparative analysis combined with the research results for the last ten years was made from several main aspects listed as below:data preprocessing,feature gene selection,classifier design classification performance evaluation.Finally,some subsistent difficulties in this research field were summarized and meanwhile,the possible directions for future work were also predicted and suggested.