随着Internet网络的普及和各种数据的爆炸涌现,如何有效地集成各种异构数据并对其进行分析处理是电子商务领域中的一个关键性难题。提出一种基于XML和资源描述框架RDF(Resource Description Framework)的中间件一包装器架构的解决方案,用XML Schema来表示异构数据源的数据模式,并通过RDF建立模式间的映射关系实现数据集成。还基于该方案实现了一个通用异构数据集成框架原型,在实验中该框架能较好地解决数据集成中的数据源多样性问题和模式间语义冲突问题,并具有良好的灵活性。
With the popularisation of the Internet and the explosive accumulation of various kinds of data, how to effectively integrate and analyze various kinds of heterogeneous data becomes a critical problem in the field of E-commerce. In this paper, we propose a solution with mediator-wrapper architecture based on XML and RDF. This architecture uses XML schema to represent the data models of heterogeneous data sources and implements the data integration by establishing the mapping relationship between these models with RDF. In this paper we also implement a prototype of the general heterogeneous data integration framework on the basis of the above solution, in experiment the framework is demonstrated to be able to well solve the problems of diversity in data sources and semantic conflict between models in data integration process with considerable flexibility.