ETL是数据仓库获得高质量数据的重要环节,在数据仓库建设过程中占有极其重要的地位。为了便于ETL过程的设计、维护和修改,提出一种基于结构图的ETL过程建模方法,并依据该方法完成了ETL概念模型的设计。通过图形化ETL过程中的元素和关联,该模型清晰直观地反映了数据的来源和流向、源数据与目标数据之间的映射和转换关系,辅助设计人员更好地进行ETL过程的设计和ETL过程的编码实现,使整个ETL设计过程更加方便、灵活。
ETL is an important part for the data warehouse to gain data with high quality, and it plays a key role in building the data warehouse system. The paper proposed a methodology for modeling ETL process based on an architecture graph, with the goal of facilitating the designment, maintenance and modification of the ETL process. On the basis of this modeling approach, the paper completed the design of ETL conceptual model. By representing the elements and relationships of the ETL process diagrammatically, it expressed the data' s coming and going, as well as their mapping and transformation relationships clearly and intuitively. It also supported the ETL designer to design the ETL process and develope the code efficiently, and improved the flexibility and reliability of the ETL process designment greatly.