为方便从多个异构的数据源中抽取转换数据、并加载到目标数据库,提出一种较为通用的抽取方法,并开发了工具软件。该方法基于客户端的方案,使用数据库驱动程序屏蔽异构数据库间的差异;数据的抽取、转换和加载由数据抽取软件执行;数据的抽取转换基于抽取规则,抽取规则采用可视化方法编辑生成;抽取规则存储于规则库,供多次调用修改;多个抽取任务组成抽取任务队列。开发的抽取工具软件DbBridge,支持从0racle,SQLServer,SAS,Access,Excel等数据源抽取数据。软件成功应用于某数据仓库的建立及数据迁移工作。
In order to extract, transform data from heterogeneous data source and load them into the target, a general method is introduced, and a tool application has been developed. This application is a client to the data source, accesses the data source by drivers provided by the individual data source vendor. So the application is independent of specific data source. The application extracts data from a source database, transforms and saves them to the target. The extraction is based on the rule created by a visual editor. The rule is saved in database for being reused. Multiple rules are running in a queue, and the running result is saved for reference later. The application can extract data from data source such as Oracle, SQL Server, SAS, Access, Excel and so on. This tool is successfully used to create a data Warehouse.