东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于随机投影的并行数据流聚类方法

期刊名称：模式识别与人工智能
时间：0
页码：113-122
语言：中文
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]复旦大学计算机与信息技术系,上海200433, [2]宁波大学信息科学与工程学院,宁波315211
相关基金：国家自然科学基金项目（No.60773072）、浙江省自然科学基金项目（No.Y104144）和浙江省教育厅项目（No.20051737）资助
相关项目：可信赖Web服务选择与协作关键技术研究

作者：陈华辉|

关键词：概要结构, 遗忘特性, 随机投影, 数据流, Synopsis, Amnesic Feature, Random Projection, Data Stream

中文摘要：

利用数据流的遗忘特性，应用随机投影，分层、动态地维护每个数据流的概要结构．基于该概要结构，快速计算数据流和聚类中心之间的近似距离，实现一种适合并行多数据流的K-means聚类方法．所进行的实验验证该方法的有效性．

英文摘要：

A synopsis is maintained dynamically for each data stream. The construction of the synopsis is based on random projections and it utilizes the amnesic feature of data stream. Using the synopsis, the approximate distances between streams and the cluster center can be computed fast. And an efficient online version of the classical K-means clustering algorithm is developed. The experimental results showy the method can be performed effectively with a good clustering quality.

同期刊论文项目

可信赖Web服务选择与协作关键技术研究

期刊论文 14 会议论文 6

同项目期刊论文

数据流上具有数据遗忘特性的小波概要

基于小波概要的并行数据流聚类.

一种基于概念格的用户兴趣预测方法

基于OWL-S 的Web 服务匹配研究

基于混合模型语义Web 服务匹配机制的研究与实现

Combining Gene Sequence Similarity and Textual Information for Gene Function Annotation in the Liter

An effective and efficient results merging strategy for multilingual information retrieval in federa

Latent semantic analysis for query interfaces of deep web sites

时间序列流的分层段模型