建立了利用美国国立生物技术信息中心(NCBI)序列表达标签(EST)数据库(dbEST)电子EST构建特定组织电子基因表达谱的生物信息学分析平台,并利用该平台构建了入正常结直肠组织电子基因表达谱.从dbEST获取人正常结直肠电子EST 20 370条,利用自行开发的GetUni程序包,与下载于本地的人类同一基因转录予(UniGene)数据库匹配,获得了含有4196个非冗余基因的正常结直肠组织电子基因表达谱.经在线基因组分析工具(Webgestalt)证实,表达谱中97%的基因在结直肠组织中表达,除涉及细胞生长、发育、分化、凋亡等基因外,还包括人正常结直肠功能特异性基因;3%未得到Webgestalt证实的基因经手工回溯查找,确证其EST均来自人正常结直肠.GetUni程序包是一个高效准确的高通量电子EST数据分析平台,构建的人正常结直肠组织基因表达谱将为结直肠特异性标志物的筛选提供大量数据.
A bioinformatic analysis platform for specific expression profiles based on expressed sequence tag (EST) database (dbEST) of National Center for Bioteehnology Information (NCBI) was constructed. Based on the platform, a human normal colorectal expression profile was obtained. 20 370 normal colorectal ESTs were downloaded from the dbEST and an expression profile including 4 196 non-redundant genes was obtained via an in-house software package. 97% of all the genes in the expression profile were confirmed to be expressed in human normal colorectum by WEB-based gene set analysis toolkit (Webgestalt) analysis. The profile contains growth, development, proliferation and apoptosis related genes and specific functional genes of colorectum . The remaining 3% genes were also confirmed to be expressed in normal colorectum by manual search in dbEST. Therefore the GetUni software package is a reliable and powerful method for specific gene expression profiles based on electronic EST analysis. The expression profile of colorectum here will be of great significance in highlighting the colorectum specific molecular markers.