传统的聚类算法大都得到了样本集的一个划分,类之间是严格的互斥关系,而现实世界中类与类之间往往没有明确的边界。该文将粗糙集理论引入到聚类分析中,提出了一种基于覆盖的粗糙聚类算法KMMRSC,它用多个中心点代表一个类,并用上、下近似来刻画样本的归属,类与类之间是一种覆盖关系。实验结果表明,该算法聚类质量优于k-均值算法,且能发现非球状簇。
Most of traditional clustering algorithms get a partition of sample set with mutually exclusive classes, while there is no explicit boundary between classes mostly in the real world. Introducing rough set theory into clustering analysis, this paper proposes a kind of overlapping-based rough clustering algorithm called KMMRSC which represents a class with multiple centroids and describes the belongingness of samples with the concepts of upper approximation and lower approximation, thus there is overlapping relationship between classes. Experiments show that the algorithm KMMRSC, which can find non-spherical clusters, outperforms classic k-means.