基于基因表达谱识别乳腺癌转移相关差异表达基因及其功能时,由于基因表达在个体间的变异相对较高而样本量相对较少,由不同研究识别的差异表达基因的可重复性较低。本文基于两套乳腺癌转移基因表达谱,评价两组差异表达基因及其所富集的功能的可重复性。结果显示:在两套表达谱中识别的差异表达基因的表达改变方向高度一致并具有显著的表达相关性;基于两组差异表达基因识别的转移相关功能在两套表达谱中高度可重复,主要涉及细胞分裂、细胞周期、DNA复制、染色体分离、磷酸肌醇介导信号转导和DNA损伤刺激应答等。
When identifying differentially expressed(DE) genes and functions associated with breast cancer metastasis based on microarray data,because of the high variation of gene expression among individuals and the relatively small numbers of samples used in most experiments,the reproducibility of DE genes and their functions detected from different studies is low.In this study,we evaluated the reproducibility of DE genes and their functions separately extracted from two microarray datasets for studying breast cancer metastasis.The results showed that the two DE gene lists exhibited high consistency in the direction of expression change and were significantly correlated.Meanwhile,the metastasis-associated functions identified using the two DE gene lists were highly reproducible across the two datasets,which were mainly associated with 'cell division','cell cycle','DNA replication','chromosome segregation','phosphoinositide-mediated signaling' and 'response to DNA damage stimulus'.