东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于约束信息的并行k-means算法

ISSN号：1001-0505
期刊名称：东南大学学报(自然科学版)
时间：0
页码：505-508
分类：TP391.4[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]南京航空航天大学信息科学与技术学院,南京210016, [2]江苏科技大学计算机科学与工程学院,镇江212003
相关基金：国家高技术研究发展计划（863计划）资助项目（2006AA12A106）; 国家自然科学基金资助项目（60903130）
相关项目：真实场景下异常检测若干关键技术及其入侵检测应用研究

作者：於跃成|王建东|郑关胜|陈斌|Yu Yuecheng|Wang Ji|ong|Zheng Guansheng|Chen Bin|

关键词： K-means, 并行k-means, 约束聚类, 约束并行k-means, k-means, parallel k-means, constrained clustering, constrained parallel k-means

中文摘要：

为获得分布式数据集上用户所期望的聚类结果,提出了基于约束信息的并行k-means聚类算法.在分析并行k-means能够有效实现对水平分布式数据集进行聚类的基础上,修改并行k-means算法的目标函数,设计约束并行k-means算法,将站点用户的约束信息以chunklet的形式引入到分布式聚类过程,从而引导算法执行有偏搜索.约束并行k-means算法在理论上保证无约束样本簇内距离最小的同时能够确保chunklet约束中的样本与对应的簇中心之间的平均距离最小.实验结果表明,约束并行k-means算法能够有效改善并行k-means的聚类精度,同时在分布式环境下能够得到与已有约束聚类算法在集中式数据集上相等价的聚类结果.

英文摘要：

In order to obtain the desired clustering results on the distributed data set,a parallel k-means algorithm is presented based on constrained information.On the basis of the facts that the parallel k-means algorithm can be effectively used in clustering the horizontal distributed data set,the objective function of the parallel k-means algorithm is modified,and the constrained parallel k-means algorithm is designed,then the constrained information of site users is introduced into the distributed clustering process in the form of chunklets,which can guide the algorithm to a bias search.Theoretically the algorithm guarantees the inter-cluster distance among the unconstrained samples to be the closest,and guarantees the average distance between constrained samples in a chunklet and the corresponding cluster center to be the closest one.The results from the experiments show that the algorithm can effectively enhance the clustering precision of parallel k-means,meanwhile it can obtain the clustering results on the distributed data set,which are equivalent to the results of the constrained k-means algorithm running on a centralized data set.

同期刊论文项目

真实场景下异常检测若干关键技术及其入侵检测应用研究

期刊论文 20 会议论文 6

同项目期刊论文

马氏度量学习中的几个关键问题研究及几何解释

Task schedulable problem and maximum scheduling problem in a multi-agent system

A Petri-net and QoS Based Model for Automatic web Service Composition

基于连通支配集的虚拟骨干网构造算法

基于连通坡面划分的多重虚拟骨干网轮换算法

Research of the traffic characteristics for the real time online traffic classification

Mobile ontology

An overview of CRS4MO project: Construction and retrieval system for mobile ontology

A Policy-based adaptive web services security framework

WSN中一种基于移动锚节点的节点定位算法

结构大间隔单类分类器

异常检测综述

基于连通支配树的异构传感器网络拓扑修复算法

Plane-Gaussian artificial neural network

基于约束正则化的生成聚类分析

隐目标回归算法设计研究

SPATIAL REGULARIZATION OF CANONICAL CORRELATION ANALYSIS FOR LOW-RESOLUTION FACE RECOGNITION

期刊信息

《东南大学学报：自然科学版》
中国科技核心期刊

主管单位:教育部
主办单位:东南大学
主编：毛善锋
地址：南京四牌楼2号
邮编：210096
邮箱：xuebao@seu.edu.cn
电话：025-83794323

国际标准刊号：ISSN：1001-0505
国内统一刊号：ISSN：32-1178/N
邮发代号:28-15

获奖情况:
先后荣获第三届国家期刊奖百种重点期刊奖，2006-2...,2013年荣获首届江苏省新闻出版政府奖"报刊奖"

国内外数据库收录:
美国化学文摘（网络版）,美国数学评论（网络版）,德国数学文摘,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:23651