东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

GPU加速的基于增量式聚类的视频拷贝检测方法

期刊名称：计算机辅助设计与图形学学报
时间：0
页码：449-456
语言：中文
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]中国科学院计算技术研究所前瞻研究实验室,北京100190, [2]中国科学院研究生院,北京100049, [3]北京中医药大学信息中心,北京100029
相关基金：国家“九七三”重点基础研究发展计划项目（2007CB311100）;国家“八六三”高技术研究发展计划（2007AA01Z416）;国家自然科学基金（60873165,60802028）;北京市科技新星计划项目（2007B071）;北京市教育委员会共建项目专项.
相关项目：融合显式和隐含语义概念的视频检索技术研究

关键词：拷贝检测, 增量聚类, 视觉关键词, 图形处理器, 计算统一设备架构, copy detection, incremental clustering, visual words, GPU, CUDA

中文摘要：

为有效地保护版权，提高大规模视频集的拷贝检测速度，提出一种完全实现在GPU上的基于增量式聚类的拷贝检测方法．对数据库中新增加的视频，首先调用GPU上的硬件解码单元对视频流解码，以实时的速度提取高维SIFT特征点；然后对特征点进行增量K—means聚类，以动态地反映数据库的变化，并根据聚类结果更新视觉关键词词典；再将每帧表示成归一化的词频向量；最后使用基于帧级别词频向量的时空顺序匹配法来判定查询视频是否为数据库中视频的拷贝．实验结果表明，该方法比原有的CPU实现方法整体提速最高达63倍．

英文摘要：

For effectiveness of privacy protection and efficiency of copy detection on large video datasets, a fully GPU-based incremental copy detection scheme is proposed. When a newly added video arrives into the database, GPU on-chip decoder is called for video stream decoding. At the same time, high dimensional SIFT features are extracted on the frame in real-time, which is followed by an incremental K-means clustering method responding to the dynamic database used to update visual words codebook. Then, each frame is represented with a visual words frequency vector. Finally, a spatiotemporal sequence matching method based on visual words representation at frame level is used to determine whether the query is a copy. Experimental results show that our GPU implementation achieves up to a 63 times speedup over the CPU version.

同期刊论文项目