针对“数据密集型科学发现”的e-Science环境下,科学活动对数据质量要求越来越严格,有必要建立一套基于过程的数据质量管理模型,为推进基于数据的科研探索活动提供更为优质的科学数据。在全面数据质量管理模型的基础上,将IS08000系列标准、数据管理、大数据技术平台以及人力资源等相关要素纳入到数据质量管理的范畴,构建了科学数据质量控制的过程参考模型,进而解析了模型的基本原则、流程及各子模型的详细构成。与其他模型相比较,该过程模型充分体现了全面质量管理和质量源于预防的思想,并通过动态的数据质量管理循环,确保数据质量的持续改进。该模型有助于科学数据质量管理的思想方法和工作步骤更加条理化、系统化、图像化和科学化。
Data requirements on scientific activities are more and more stringent in the " data intensive scientific discovery" e-Science environment. It is necessary to establish a set of data quality management model based on process, in order to provide more qualitative scientific data for promoting the scientific research activities based on the data. A process reference model of scientific data quality management is constructed on the basis of total data quality management model, combining with other factors related to data quality management category, such as the ISO9000 series standards, data management, technology platform and human resources. And then basic principles, process and sub models of the reference model are discussed in details. Comparing with other models, this process reference model embodies the thought of total quality management and quality prevent by using the dynamic cycle in data quality control to ensure continuous improvement of data quality. The model can help to make the scientific data quality management process more orderly, systematic and scientific.