基于有限混合模型的聚类算法具有以下缺陷:聚类结果依赖于模型的初始化参数;聚类结果容易收敛于局部最优;聚类过程无法决策聚类数量。为了解决这些问题,提出了一种基于负补偿函数的自由能量方程,对此方程的训练会产生类似于模拟退火的效应,增大了获得全局最优聚类的可能性。提出了一种基于补偿函数的泛化模型选择方法以用于聚类数量决策。提出了一种基于聚类重叠度的动态控制法以用于权衡退火效应以及聚类数量的决策。实验结果表明,新算法的聚类性能明显优于其他传统算法。
Drawbacks of finite-mixture-model-based clustering algorithms are as follows:the dependence on the choices of initial parameters;the convergence to a local optimum;the incapability of cluster number determination.To tackle these problems,a new model is developed by introducing a penalty function into the negative free energy function.The learning of this model presents a similar annealing process as that of the simulated-annealing algorithm,which may increase the chance of global optimization.A generalized model selection method is introduced for cluster-number determination.To avoid either a sudden temperature decay in an annealing process or an over-or under-estimation in the number of clusters,the weight of penalty function is adaptively controlled by a novel method based on the overlapping degree between clusters.Experimental results show that the proposed approach outperforms the conventional algorithms.