以RefSeq数据库和已测序基因组序列为模板,通过大规模计算得到代表转录各层次信息的“标准转录数据库”,并利用通用网关接口技术,建立了人类和模式生物标准转录数据集Web服务系统。用户提交RefSeq记录号或自由注释词,可检索获得序列的全部信息,实现对基因结构解析的在线计算。目前系统覆盖了人、拟南芥、水稻、大鼠、小鼠、斑马鱼等6个物种,拥有数据记录18万余条。为深入研究人类及其他物种转录组提供了重要工具,并为进一步分析真核基因的可变剪接方式提供了坚实的数据基础。
Using Refseq database and standard genomic sequences as template, through large scale computing analysis, we get the standard transcription database named"StdTransDb". A web server which supports sequence search and online transcript computing by different algorithms has been implemented by using CGI technology. User can provide Refseq access number or annotation words to retrieve related information and get gene structure parsing results online.The system includes over 180,000 records of six model species, including Homo sapiens, Arabidopsis thaliana, Oryza sativa, Mus musculus, Rattus norvegicus and Danio rerio. The system provides important tool for human and other species transcriptome research, and contributes to the analysis on alternative splicing of eukaryotic genes.