利用DNA中转录因子结合位点分布的序列比较方法对DNA序列进行聚类,并分析基因之间的联系。运用Matlab工具结合TRANSFAC数据库中的数据,对一组基因芯片共调控基因的上游序列进行比较和聚类,获得能够反映基因关系的树状聚类结果,从中确定出具有共同功能特征的基因,揭示了在大骨节病相关的诸多基因中,基因CIDEA、CYP4V2、RHBDD3、ENC1的调控区域有共同序列特征,表达模式和调控机理最为相似。这为更深层次的基因功能分析提供了依据。
A new scheme for comparison genes according to the arrangement of transcription factor binding sites (TFBS) in upstream regions of genes is proposed in order to study the relationship between some genes. The information of TFBS was obtained from transfac database and position specific scoring matrices was used to analyze. A group of upstream sequences of co-regulation genes by a mieroarray experiment were clustered and a spanning tree was presented as a result. Result indicated that in the kashin-beck disease related genes, the regulatory regions of CIDEA, CYP4V2, RHBDD3 and ENC1 shared similar sequences. This was presumed to be the reason that their expression pattern and the regulatory mechanism were similar. The result also provided a clue for the further gene relationship analysis.