针对现实世界中数据的杂乱性、冗余性和不完整性而提出了一种基于遗传算法的组合数据清理方法。利用这种方法,使得数据在挖掘之前具有更好的完整性和一致性,可以提高数据挖掘模式的总体质量和减少实际挖掘所需要时间。以学生成绩作为试验对象进行了分析和研究,有效地去除了数据中的噪声,试验结果证明了此方法的可行性和有效性。
A data preprocessing combination algorithm was put forward to avoid imperfection and inconsistency of data in real world.This algorithm made data more cleaning and integrated before data mining,which improved data mining quality and reduces data mining time.A test applies in students grade research was set up and gets valid results.