为识别和改进数据中存在的质量问题,采用Benford法则进行数据质量挖掘分析,该方法通过分析数字分布规律来检测数据的合理性,达到控制数据质量的目的.以水文数据中降水量数据为样本验证方法的有效性.实验结果表明,该方法能有效识别数据集中存在异常信息,提高了水文数据的数据质量,具有一定的应用前景.
In order to identify and improve the data quality problems,this paper adopts a data quality mining methods based on Benford's Law.This method detects the rationality of data through analyzing the distribution of the data to reach the goal of data quality control.Finally,we used precipitation data as a sample to verify the validity of this method.The results show that the method can effectively identify the abnormal data,improving the data quality,and has certain application prospect.