位置:成果数据库 > 期刊 > 期刊详情页
基于蛋白质互作知识的生物学通路扩充新方法
  • ISSN号:0253-9772
  • 期刊名称:遗传
  • 时间:0
  • 页码:-
  • 分类:Q-4[生物学]
  • 作者机构:[1]广东医学院医学系统生物学研究所与公共卫生学院,东莞523808, [2]中山大学公共卫生学院,广州510080, [3]茂名市人民医院,茂名525000
  • 相关基金:国家自然科学基金项目(编号:31071166,81373085),广东省科技计划攻关项目(编号:2009A030301004),东莞市科技重点项目(编号:201108101015)和广东医学院基金项目(编号:XG1001,JB1214,XZ1105,STIF201122,M2011024,M2011010)资助
  • 相关项目:全基因组复杂疾病遗传通路分析方法研究
中文摘要:

生物学通路被广泛应用于基因功能学研究,但现有的生物学通路知识并不完善,仍需进一步扩充.生物信息学预测为通路扩充提供了一种有效且经济的途径.文章提出了一种融合蛋白质-蛋白质互作知识以及Gene Ontology(GO)数据库信息进行基因通路预测的新方法.首先选取目标基因在蛋白质-蛋白质互作层面上的邻居所在的Kyoto Encyclopedia of Genes and Genomes(KEGG)通路为候选通路,然后通过检验候选通路中的基因是否在与目标基因关联的GO节点富集来判断目标基因的通路归属.分别利用Human Protein Reference Database(HPRD)和Biological General Repository for Interaction Datasets(BioGRID)数据库中的蛋白质-蛋白质互作信息进行预测.结果表明,在两套数据中,随着互作邻居个数的增加,预测的平均准确率(在所有目标基因注释的通路中被成功预测的比例)及相对准确率(在至少有一个注释通路被成功预测的基因集中,所有注释通路均被预测正确的基因所占的比例)均呈现上升趋势.当互作邻居个数达到22时,预测的平均准确率分别达到96.2%(HPRD)和96.3%(BioGRID),而相对准确率分别为93.3%(HPRD)和84.1%(BioGRID).进一步利用新版数据库对旧版数据库中被更新的89个基因进行验证,至少有一个更新通路被预测正确的基因有50个,其中43个基因的更新通路被完全正确预测,相对准确率为86.0%.这些结果显示该方法是一种可靠且有效的通路扩充方法.

英文摘要:

Biological pathways have been widely used in gene function studies; however,the current knowledge for biological pathways is per se incomplete and has to be further expanded.Bioinformatics prediction provides us a cheap but effective way for pathway expansion.Here,we proposed a novel method for biological pathway prediction,by intergrating prior knowledge of protein-protein interactions and Gene Ontology (GO) database.First,the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways to which the interacting neighbors of a targe gene (at the level of protein-protein interaction) belong were chosen as the candidate pathways.Then,the pathways to which the target gene belong were determined by testing whether the genes in the candidate pathways were enriched in the GO terms to which the target gene were annotated.The protein-protein interaction data obtained from the Human Protein Reference Database (HPRD) and Biological General Repository for Interaction Datasets (BioGRID) were respectively used to predict the pathway attribution(s) of the target gene.The results demanstrated that both the average accuracy (the ratio of the correctly predicted pathways to the totally pathways to which all the target genes were annotated) and the relative accuracy (of the genes with at least one annotated pathway being successful predicted,the percentage of the genes with all the annotated pathways being correctly predicted) for pathway predictions were increased with the number of the interacting neighbours.When the number of interacting neighbours reached 22,the average accuracy was 96.2% (HPRD) and 96.3% (BioGRID),respectively,and the relative accuracy was 93.3% (HPRD) and 84.1% (BioGRID),respectively.Further validation analysis of 89 genes whose pathway knowledge was updated in a new database release indicated that 50 genes were correctly predicted for at least one updated pathway,and 43 genes were accurately predicted for all the updated pathways,giving an estimate of the r

同期刊论文项目
期刊论文 27 会议论文 7 著作 4
同项目期刊论文
期刊信息
  • 《遗传》
  • 中国科技核心期刊
  • 主管单位:中国科学院
  • 主办单位:中国遗传学会
  • 主编:张永清
  • 地址:北京朝阳区北辰西路1号院中国科学院遗传发育所
  • 邮编:100101
  • 邮箱:yczz@genetics.ac.cn
  • 电话:010-64807669
  • 国际标准刊号:ISSN:0253-9772
  • 国内统一刊号:ISSN:11-1913/R
  • 邮发代号:2-810
  • 获奖情况:
  • 中国自然科学核心期刊,《CAJ-CD》执行优秀奖,2008年12月获“中国精品科技期刊”证书和北京市印...
  • 国内外数据库收录:
  • 美国化学文摘(网络版),英国农业与生物科学研究中心文摘,荷兰文摘与引文数据库,美国生物医学检索系统,美国生物科学数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版),中国北大核心期刊(2000版)
  • 被引量:23270