随着信息技术的发展,Web上的数据日趋成为当今数据的主流,但是Web上的数据多是异构的,而越来越多的人需要访问各种异构数据,为了满足这种需求,必须有一种系统能够支持异构数据集成。异构数据集成的方法有很多,给出一个基于XML的虚拟法异构数据集成系统体系结构,引入虚拟法,提出用虚拟法进行异构数据集成;最后引入数据清洗技术,能够较好地解决异构数据的集成。
With the development of information technology, Web-based data becomes the mainstream of today' s data, but data on Web are mostly heterogeneous. More and more people need to access all kinds of heterogeneous data, so there must be a kind of system which can support heterogeneous data integration. There are many kinds of methods that can carry out heterogeneous data integration, but this paper introduced a kind of architecture of the virtual approach for heterogeneous data integration based on XML in order to solve the problem of heterogeneous data integration on Web. It proposed to use the virtual approach to carry out heterogeneous data integration, and finally introduced the technology of data cleaning which could solve the integration of heterogeneous data better.