东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

原核基因翻译起始位点预测的新方法

ISSN号：1000-3282
期刊名称：《生物化学与生物物理进展》
时间：0
分类：Q753[生物学—分子生物学] TS261.11[轻工技术与工程—发酵工程;轻工技术与工程—食品科学与工程]
作者机构：[1]北京大学生物医学工程系、湍流与复杂系统国家重点实验室,北京100871, [2]北京大学理论生物学中心,北京100871, [3]Department of Mathematics, UCLA, Los Angeles, CA 90095, USA, [4]The Finnish Genome Center, University of Helsinki, 00290 Helsinki, Finland
相关基金：国家自然科学基金（30770499,30300071,10721403）和国家重点基础研究发展计划（2003CB715905）资助项目.We thank YAO Xin-Qiu, KANG Hong and SUN Zong-Xiao of Peking University, Philippe Ortet of CEA Cadarache, and Prof. WANG Jin of Nanjing University for beneficial discussions. We also thank Prof. ZHANG Chun-Ting for providing the GS-Finder program, and Dr. Yuko Makita and Dr. Michalis Aivaliotis for providing parts of the data sets. We are very grateful to Dr. Iain C Bruce of Zhejiang University School of Medicine for carefully reading and revising the manuscript.

作者：胡钢清[1,2], 刘永初[1,2], 郑晓斌[1,2], 杨一帆[1,2], 余振苏[1,2,3], 朱怀球[1,2,4]

关键词：原核生物, 基因预测, 翻译起始位点, 预测评价, prokaryote, gene prediction, translation initiation site, prediction evaluation

中文摘要：

翻译起始位点（TIS，即基因5’端）的精确定位是原核生物基因预测的一个关键问题，而基因组GC含量和翻译起始机制的多样性是影响当前TIS预测水平的重要因素．结合基因组结构的复杂信息（包括GC含量、TIS邻近序列及上游调控信号、序列编码潜能、操纵子结构等），发展刻画翻译起始机制的数学统计模型，据此设计TIS预测的新算法MED．StartPlus．并将MED．StartPlus与同类方法RBSfinder、GS．Finder、MED-Start、TiCo和Hon-yaku等进行系统地比较和评价．测试针对两种数据集进行：当前14个已知的TIS被确认的基因数据集，以及300个物种中功能已知的基因数据集．测试结果表明，MED-StartPlus的预测精度在总体上超过同类方法．尤其是对高GC含量基因组以及具有复杂翻译起始机制的基因组，MED-StartPlus具有明显的优势．

英文摘要：

Accurate prediction of the translation initiation site （TIS） is an important issue for prokaryotic genome annotation. However, it is still a challenge for the existing methods to predict the TIS in the genomes over a wide variety of GC content. Besides, the existing methods have not yet undergone a comprehensive evaluation, leaving prediction reliability as a largely open problem. A new algorithm MED-StartPlus, a tool that predicts TIS in prokaryotic genomes with a wide variety of GC content was presented. It makes several efforts to model the nucleotide composition bias, the regulatory motifs upstream of the TIS, the sequence patterns around the TIS, and the operon structure. Tests on hundreds of reliable data sets, with TISs confirmed by experiments or having annotated functions, show that the new method achieves a totally high accuracy of TIS prediction. Compared with existing TIS predictors, the method reports a totally higher performance, especially for genomes that are GC-rich or have complex initiation mechanisms. The potential application of the method to improve the TIS annotation deposited in the public database was also proposed.

同期刊论文项目

基于翻译起始机制的原核基因组比较的生物信息学研究

期刊论文 18

真核基因预测新算法的研究

期刊论文 7 会议论文 1 著作 1

生物网络研究

期刊论文 72

同项目期刊论文

MED: a new non-supervised gene

A New Method for Protein Secon

基于剪接信号和调节元件序列特征的剪接位点预测方法

A new method for splice site prediction based on the sequence patterns of splicing signals and regulatory elements

基于迭代自学习的操纵子结构预测

致病性大肠杆菌UPEC CFT073全基因组分析及致病机制的新认识

Water-protein interplay reveals the specificity of alpha-lytic protease

ProTISA: a comprehensive resource for translation initiation site annotation in prokaryotic genomes

Computational evaluation of TIS annotation for prokaryotic genomes

New Solutions of Translation Initiation Site Prediction for Prokaryotic Genomes

蛋白折叠中的暂态结构与表面水分子慢尺度动力学

基于剪接信号和调节元件序列特征的剪接位点预测方法

致病性大肠杆菌UPECCFT073全基因组分析及致病机制的新认识

A new method for splice site prediction based on the sequence patterns of splicing signals and regulatory elements

基于迭代自学习的操纵子结构预测

Micro-pressure sensor made of conductive PDMS for microfluidic applications

Predicting cell cycle genes from E-MAP profiles by integrating multiple types of data

Prediction of Protein Functions from Protein-Protein Interaction Data Based on a New Measure of Netw

The maturation mechanism of SARS coronavirus 3C -like proteinas

Detecting multiple confounders

MetaTISA: Metagenomic translation initiation site annotator for improving gene start prediction

Prediction of translation initiation site for microbial genomes with TriTISA

Synthesizing a novel genetic sequential logic circuit: a push-on push-off switch

Protein-protein interactions: interface analysis, binding free energy calculation and interaction de

Intrinsically disordered proteins: the new sequence-structure-function relations

Nonparametric covariance model

Folding simulations of a de novo designed protein with βaβ fold

Local site preference rationalizes disentangling by DNA topoisomerases

The analysis of biases of copy numbers from Affymetrix SNP array

Water dynamics clue to key residues in protein folding

Smoothing molecular interactions: the “kinetic buffer” effect of intrinsically disordered proteins

Defining network yopologies that can achieve biochemical adaptation

The why and how of DNA unlinking

Kinetic advantage of intrinsically disordered proteins in coupled folding–binding process: a critica

Partial orientation and local structural learning of causal networks for prediction

贝叶斯网络和因果网络

因果挖掘的若干统计方法

Genome reannotation of Escherichia coli CFT073 with new insights into virulence

Water-protein interplay reveals the specificity of alpha-lytic protease.

De novo design of a βaβ motif

Hybridization modeling of oligonucleotide SNP arrays for accurate DNA copy number estimation

Partial orientation and local structural learning of DAGs for prediction

Acceleration of the EM and ECM algorithms using the Aitken δ2 method for log-linear models with part

A new method for splice site prediction based on the sequence patterns of splicing signals and regul

Finding Multiple-Target Optimal Intervention in Disease Related Molecular Network

Review and prospect of the project of applied statistical methods

Inward Propagating Chemical Waves in a Single-Phase Reaction-Diffusion System

蛋白折叠中的暂态结构与表面水分子慢尺度动力学

基于剪接信号和调节元件序列特征的剪接位点预测方法

致病性大肠杆菌UPECCFT073全基因组分析及致病机制的新认识

Expanded flux variability analysis on metabolic network of Escherichia coli

SARS 3CL蛋白酶同源二聚化与底物结合互为别构调控因素

蛋白质与线性DNA片断结合过程中的末端效应

蛋白质相互作用：界面分析,结合自由能计算与相互作用设计

Vif-APO相关的HIV－1病毒模型

T细胞受体激活过程模拟

Fluctuation theorem for the mutation process in in vitro evolution

Modeling the intracellular dynamics for Vif-APO mediated HIV-1 virus infection

A new method for splice site prediction based on the sequence patterns of splicing signals and regulatory elements

期刊信息

《生物化学与生物物理进展》
中国科技核心期刊

主管单位:中国科学院
主办单位:中国科学院生物物理研究所中国生物物理学会
主编：王大成
地址：北京市朝阳区大屯路15号
邮编：100101
邮箱：prog@sun5.ibp.ac.cn
电话：010-64888459

国际标准刊号：ISSN：1000-3282
国内统一刊号：ISSN：11-2161/Q
邮发代号:2-816

获奖情况:
1999年中国期刊奖提名奖,2000年中国科学院优秀期刊特别奖

国内外数据库收录:
美国化学文摘（网络版）,荷兰文摘与引文数据库,美国剑桥科学文摘,美国科学引文索引（扩展库）,美国生物科学数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:18821