随着高通量测序技术的迅速发展和食品微生物研究的逐步深入,产生了大量的数据和知识,且以不同的数据格式分布在各种数据库中。为了更好地支持食品微生物的相关研究,从各种分布式、异构的数据和知识中,进行数据提取与转换,并形成一个整合的数据平台显得尤为重要。FoodMicrobes数据库利用语义网技术,建立了一个食品微生物的整合型数据平台。该平台从各种开放的公共数据库,提取了与食品微生物相关的基因、基因组、基因功能、蛋白质序列与结构、代谢途径、文献、专利等信息,利用RDF的方法,对数据进行转换,并建立了数据之间的关联,实现了数据整合,是目前在食品微生物领域以语义网方式建立的第一个数据库。在该平台中,实现了将食品微生物的物种、菌株层面的宏观信息与基因组、蛋白质、代谢与功能等微观层面信息的贯通,并通过友好的数据检索界面,为用户进行食品微生物研究提供了重要的工具。
With the rapid development of next generation sequencing technology and the researches on fermentation mechanism of food microorganism, data and knowledge of food microorganisms increased enormously, including genomic, metagenomics, metabolic and phylogenetic information. These data are distributed from different resources with various data formats. An integrated data platform is necessary for better understanding of biological knowledge from such growing heterogeneous data. As a result,we construct a food microorganism database using semantic wcb technology. We describe information of gene,genome sequences,gene ontology,protein sequences and structures,pathway and enzyme in the form of Resource Description Framework( RDF) from a wide range of open data resources. In this database,physiological information of microbes from culture collections could be linked to the genomic information and further linked to the metabolic information which allows flexible queries across different domains. User-friendly interfaces of the database provide the ability to answer a number of food microorganisms research related questions based on the linked data.