对于时间序列的基因表达数据,传统的聚类算法都是以距离为相似性度量标准,没有考虑基因随时间变化的相似趋势。从基因变化的趋势出发,构造了一种新的模糊相似关系矩阵,提出了改进的基于模糊相似关系的聚类算法,并以该算法计算FCM的初始聚类中心。将该方法应用在酵母菌基因表达数据中,实验结果表明该算法不仅克服了FCM算法易陷入局部极小值、对初值敏感的缺点,而且能够发现一些表达模式变化趋势相似的共调控基因。
For time series gene expression data,the similarity measure of traditional clustering algorithm is measured based on distance.There is no consideration the coherent trend of expression patterns gene exhibit with time process.A new fuzzy similar relation matrix is constructed and a modified clustering algorithm based on fuzzy similarity relation is proposed.On this base,a new method is used to find the initial center of FCM algorithm.The method is used in yeast gene expression data.Experimental results show that the method not only overcomes the limitation of FCM algorithm,but also identifies cell-cycle regulated genes where expression levels change periodically during the cell cycle.