随着信息技术快速的发展与信息系统应用的加深,企业积累了大量的历史数据,支撑企业正常运营与决策。为了使决策更加正确、有效,需要对历史数据进行有效的数据质量评估,并基于评估结果进行数据清洗。重点研究了基于数据维度下数据质量约束的数据质量评估方法,确定数据质量评估维度,并定义了各维度下的数据质量约束,并基于约束给出数据质量评估算法。方法在大庆油田生产数据库数据质量评估项目与河北汉光重工有限责任公司财务系统数据库数据质量评估项目中得到了应用。
With the rapid development of information technology and the deepening applications of information systems,companies have accumulated a large amount of historical data,which support the normal business operations and decision-making.In order to make the right decisions more effectively,need to effective assess the data quality of the historical data,and cleansing data based on the results of the assessment.Data quality assessment method based on data constraints under the rules of the dimension is focused on,determined the dimensions of data quality assessment,defined the constraints rules of data quality under the data dimensions,and given the data quality assessment algorithm based on the constraint-rules.The method has been applied in Daqing Oil field production database data quality assessment project and Hebei Hanguang Heavy Industry Co.,Ltd financial system database data quality assessment project.