针对传统K—means聚类算法的聚类结果易随不同的初始聚类中心波动的问题,采用最大距离积法优化K—means聚类算法的初始聚类中心。传统的K—means聚类算法都假定样本的各维特征对聚类的贡献相同,影响了聚类效果和模型估计精度。为了考虑样本各维特征对聚类的不同影响,利用一种新型的特征加权K-means聚类算法逐步调整特征权值,最终有效改善了聚类效果。利用本文方法建立组合支持向量机模型,将其用于双酚A生产过程质量指标的软测量建模中,仿真结果表明该算法能够有效改进数据的分类效果并提高软测量模型的估计精度。
Clustering results of traditional K-means clustering algorithm easily fluctuates with random initializing cluster centers. In order to solve this problem, maximum distances product algorithm is adopted. Secondly, clustering assumes that each feature of the samples is the same to the contribution of clustering. Therefore, a new feature-weighted K-means clustering algorithm is proposed which adjusts feature weights gradually on the basis of traditional K-means flustering, and improves the clustering result finally. A combination model based on support vector machine(SVM) is established and is applied to a soft sensor modeling for the quality index in a Bisphenol A production process. The simulation result shows that the data classification result is effectively improved by the algorithm, and the estimation precision is improved.