针对传统的局部离群点检测算法中存在离群点判定的主观性过强的问题,通过研究局部离群点检测相关算法,提出了一种基于数据集对象平均离群因子的离群点选择算法.该算法首先求得各数据对象的邻域对象,进而根据邻域数据集合求出各自的离群因子,在进行离群点选择时计算出数据集的平均离群因子,将每个对象的离群因子与平均离群因子进行比较判断对象是否为离群点.理论分析和实验对比结果均表明,该算法在进行离群点选择时可以有效地避免离群点选择的主观性,更好地提高检测的准确率.
Aiming at the shortage of subjectivity existed in the traditional local outlier detection algorithm,we studied the relevant algorithm of local outlier detection,and a local outlier selection algorithm based on the average outlier factor was proposed.This algorithm calculates the neighborhood of each data object,and figures out the outlier factor according to the dataset of its neighborhood.Comparing the outlier factor of each object with the average outlier factor to determine whether the object is an outlier.Theoretic analysis and experimental results show that this algorithm can avoid the subjectivity of selection and improve the accuracy of selection.