东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

一种有效的海量数据Top—k Dominating查询算法

ISSN号：0254-4164
期刊名称：《计算机学报》
时间：0
分类：TP311[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
作者机构：[1]哈尔滨工业大学计算机科学与技术学院,哈尔滨150001
相关基金：国家“九七三”重点基础研究发展规则项目基金（20i2CB316200）、国家自然科学基金（61190115,61173022,61033015,60831160525,61272046,60903016）和哈尔滨工业大学科研创新基金项目（HIT.NSRIF.2014136）资助.

关键词：海量数据, TOP-K, dominating查询, TDEP算法, 有序列表, 早剪切操作, massive data, top-k dominating query, TDEP algorithm, sorted lists, early pruning

中文摘要：

在多准则决策支持等多个应用中，top-k dominating查询是一种十分实用的查询，它在潜在的巨大的数据空间中返回k个支配分数最大的元组.现有算法，要么需要为特定的属性组合构建索引，要么需要较大的I/O费用或内存费用，从而无法有效处理海量数据上top-k dominating查询.文中提出一种新的查询算法TDEP，该算法利用以较小代价为每个属性构建的有序列表来有效返回海量数据上的top-k dominating查询结果.文中将TDEP算法的执行明确地分为两个阶段：增长阶段和收缩阶段.在每个阶段，TDEP算法以round-robin方式读取涉及到的有序列表并维护候选元组，直到满足结束条件.文中分析了两个阶段的执行行为，提出一种新的不需要重新读取有序列表的支配分数计算方法.同时，文中还提出有效的早剪切操作，可以有效减少TDEP算法需要维护的候选元组数量.实验结果表明：和现有算法相比，TDEP算法具有较大的性能优势.

英文摘要：

In many applications like multi-criteria decision making, top-k dominating is a practically useful tool to return k tuples with the highest domination scores in a potentially huge data space. The existing algorithms, either requiring indexes built on the specific attributes, or incurring high I/O cost or memory cost, cannot process top-k dominating query on massive data efficiently. In this paper, a novel algorithm TDEP is proposed to utilize sorted lists built for each attribute with low-cost to return top-k dominating results on massive data efficiently. Through analysis, it is found that TDEP is executed in two phases： growing phase and shrinking phase. In each phase, TDEP retrieves the sorted lists in a round-robin fashion and maintains the candidates until the termination condition is satisfied. The theoretical analysis is provided for the execution behavior in two phases. An efficient method is developed to compute the domination scores for tuples without re-retrieving the sorted lists. Besides, TDEP adopts early pruning to reduce the number of candidate tuples maintained. The extensive evaluation results show the significant performance advantage of TDEP over the existing algorithms.

同期刊论文项目

云计算中TB/PB级海量数据近似查询处理技术的研究

期刊论文 4

低能耗海量数据管理理论与关键技术研究

期刊论文 22 会议论文 5

基于云计算环境的TB/PB级海量数据查询处理技术的研究

期刊论文 17 会议论文 1

非确定传感网数据整合

期刊论文 31

信息物理融合系统(CPS)的基础理论和关键技术研究

期刊论文 103 会议论文 42 获奖 2 著作 1

持续进化型感知数据量质融合管理的基础理论与关键技术研究

期刊论文 16

同项目期刊论文

Finding the Cost-Optimal Path with Time Constraint over Time-Dependent Graphs

Novel ε-Approximation to Data Streams in Sensor Networks

Mining most frequently changing component in evolving graphs

云系统中能量有效的数据摆放算法和节点调度策略

TJJE: 海量数据上高效top-k连接算法

Efficient Top-k Retrieval on Massive Data

TDEP: Efficiently processing top-k dominatingquery on massive data

SEPT: an efficient skyline join algorithm onmassive data

异构信息网上的可达查询

Minimized-CostCube Query in Heterogeneous Information Networks

大数据上高效Skyline计算

一种基站可移动传感器网络再编程协议

RM树：一种支持字符串相似性操作的索引

PAA：海量数据上一种有效的近似聚集查询算法

无线传感器网络中最小化通信开销的近似监测算法

多维代价图模型上最优路径查询问题的研究

时间依赖代价函数下的最优路径查询问题研究

基于实体的相似性连接算法

无线传感器网络中基于双阈值的分布式监测算法

Exact and approximate algorithms for the most connected vertex problem

On the Complexity of View Update Analysis and its Ap

A System for Cleaning Data with Certain Fixes

MPMC:一种无线传感器网络多信道多功率数据聚集调度算法

劣质数据库上阈值相似连接结果大小估计

异构信息网上的可达性查询

基于图压缩的k可达查询处理

无线传感器网络中ε-近似区域聚集算法

(ε, δ)-Approximate Aggregation Algorithms in Dynamic Sensor Networks

Adding regular expressions to graph reachability and pattern queries

Mining Frequent Subgraphs over Uncertain Graph Databases under Probabilistic Semantics

数据时效性判定问题的求解算法

一种WSN中基于模式匹配与多节点相关性分析的复杂事件检测算法

MovPro: Data Dissemination for Reprogramming in Wireless Sensor Networks with Mobile Sink

Efficient algorithms for supergraph query processing on graph databases

Finding multiple induced disjoint paths in general graphs

Secure and Efficient Control Transfer for IoT Devices

RSPEED：无线传感器网络中基于不确定延迟的可靠实时路由

On the complexity of sampling query feedback restricted database repair of functional dependency vio

数据时效性修复问题的求解算法

无线传感器网络中近似事件检测节点调度问题

无线网络中基于任务的实时传输调度策略

不确定图上期望最短距离的计算

Information quality-aware tracking in uncertain sensor network

Novel ε-Approximation to Data Streams in Sensor Networks

Approximate Physical World Reconstruction Algorithms in Sensor Networks

无线传感器网络中能量高效的Top-k监测算法

Supporting early pruning in top-k query processing on massive data

一种能量有效的双层传感器网络top-k安全查询机制

Grouping-Enhanced Resilient Probabilistic En-Route Filtering of Injected False Data in WSNs

CPS中基于多模态事件融合模型的最优覆盖问题

On the hardness of Labelled Correlation Clustering problem: a parameterized complexity view

Evaluating Entity-Description Conflict on Duplicated Data

Deadline Aware Retransmission Threshold Setting Protocol In Cyber-Physical Systems

Rule-Based Method for Entity Resolution

大数据的一个重要方面：数据可用性

Efficiently processing (p,ε)-approximate join aggregation on massive data

无线传感器网络中基于链路质量的路径延时分析

Curve Query Processing in Wireless Sensor Networks

无线传感器网络 (ε, δ) –近似Top-k查询处理算法

一种基站可移动传感器网络再编程协议

Reliability-Aware Power Adjustment in Air-Soil Wireless Sensor Networks

无线传感器网络中近似加权聚集算法

Efficient Skyline Computation on Big Data

e<span style="font-size: 10.5pt

Efficient Algorithms for Summarizing Graph Patterns

Approximate Aggregations in Structured P2P Networks

A Distributed and Kernel-Based Scheme for Location Verification in Wireless Sensor Networks

Model Based Adaptive Data Collection Algorithm in Wireless Sensor Networks

Minimizing Failure Probability of Data Packet Delivery With Delay Guarantee

一种局部相关不确定数据库快照集合上的概率频繁最近邻算法

基于实体描述属性技术的XML重复对象检测方法

<span lang="EN-US" style="font-size: 10.5pt; font-family: "Times New Roma

无线传感器网络中 <span style="font-size

Data Collection in Multi-Application Sharing Wireless Sensor Networks

一种基于不确定规则的数据时效性判定方法

Incremental Detection of Inconsistencies in Distributed Data

e</spa

<span style="font-family: 宋体; font-size: 10.5pt; mso-fareast-theme-font: major-fareast; mso-

时间依赖代价函数下的最优路径查询问题的研究

RB树：一种支持空间关键字近似查询的外存索引

无线传感器网络具有跟踪质量保证的节点选择算法

无线传感器网络中可容错的事件监测算法

复杂数据上的实体识别技术研究

PAA：海量数据上一种有效的近似聚集查询算法

无线传感器网络中最小化通信开销的近似监测算法

基于实体的相似性连接算法

ε-近似和加权公平性保证的无线传感器网络拥塞控制算法

无线传感器网络高可靠低维护地理路由协议

基于MapReduce的Skyline-join查询算法

低占空比无线传感器网络中基于动态切换的实时路由协议

混合无线传感器网络中的网关部署算法

BSCTC：传感网的基于切向约束的B样条等值线查询算法

基于图压缩的最大Steiner连通k核查询处理？

无线传感器网络数据收集问题综述

无线传感器网络关键技术研究

Distributed Aggregation Algorithms for Mobile Sensor Networks with Group Mobility Model

传感器网络基于小波分段常值压缩的数据收集研究

Minimum-Time Aggregation Scheduling in Duty-Cycled Wireless Sensor Networks

无线传感器网络中基于双阈值的分布式监测算法

异构信息网上的可达性查询

基于图压缩的k可达查询处理

RSPEED：无线传感器网络中基于不确定延迟的可靠实时路由

不确定图上期望最短距离的计算

无线传感器网络中能量高效的Top-k监测算法

无线传感器网络中基于链路质量的路径延时分析

一种基站可移动传感器网络再编程协议

PAA：海量数据上一种有效的近似聚集查询算法

无线传感器网络中最小化通信开销的近似监测算法

低占空比无线传感器网络中基于动态切换的实时路由协议

基于图压缩的最大Steiner连通k核查询处理？

无线传感器网络数据收集问题综述

无线传感器网络关键技术研究

异构信息网挖掘：概念、技术与未来

海量数据上的近似连接聚集操作

DBCC-Join: A novel cache-conscious disk-based join algorithm

TKEP:海量数据上一种有效的Top-K查询处理算法

TJJE: An efficient algorithm for top-k join on massive data

Ad-hoc aggregate query processing algorithms based on bit-store for query intensive applications in

RB树:一种支持空间近似关键字查询的外存索引

RM树：一种支持字符串相似性操作的索引

PAA：海量数据上一种有效的近似聚集查询算法

多维代价图模型上最优路径查询问题的研究

时间依赖代价函数下的最优路径查询问题研究

DBCC-Join：一种新的高速缓存敏感的磁盘连接算法

基于MPI的二维泊松方程差分并行实现与测试

云计算系统中查询处理及优化技术研究综述

外存中高效的字符串相似性查询处理

无线传感器网络中基于双阈值的分布式监测算法

传感器网络中一种基于多元回归模型的缺失值估计算法

海量数据上的近似连接聚集操作

TKEP:海量数据上一种有效的Top-K查询处理算法

无线传感器网络中一种近似Skyline查询处理算法

不确定图数据库中高效查询处理

从图数据库中挖掘频繁跳跃模式

不确定数据上两种查询的分布式聚集算法

传感器网络中一种基于时-空相关性的缺失值估计算法

无线传感器网络中ε-近似区域聚集算法

演变图上的连接子图演变模式挖掘

无线传感器网络一种不相交路径路由算法

无线传感器网络中能量高效的Top-k监测算法

一种基站可移动传感器网络再编程协议

无线传感器网络中可容错的事件监测算法

PAA：海量数据上一种有效的近似聚集查询算法

基于实体的相似性连接算法

ε-近似和加权公平性保证的无线传感器网络拥塞控制算法

无线传感器网络高可靠低维护地理路由协议

混合无线传感器网络中的网关部署算法

DBCC-Join：一种新的高速缓存敏感的磁盘连接算法

基于2-hop优化的子图模式匹配算法

XCluster：基于聚类支持查询的XML多文档压缩方法

无线传感器网络中（ε,δ）-近似聚集算法

基于子树匹配的相似xml连接方法的研究

无线传感器网络关键技术研究

Minimum-Time Aggregation Scheduling in Duty-Cycled Wireless Sensor Networks

PAA：海量数据上一种有效的近似聚集查询算法

基于任务合并的并行大数据清洗过程优化

外存中高效的字符串相似性查询处理

期刊信息

《计算机学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国计算机学会中国科学院计算技术研究所
主编：孙凝晖
地址：北京中关村科学院南路6号
邮编：100190
邮箱：cjc@ict.ac.cn
电话：010-62620695

国际标准刊号：ISSN：0254-4164
国内统一刊号：ISSN：11-1826/TP
邮发代号:2-833

获奖情况:
中国期刊方阵“双效”期刊

国内外数据库收录:
美国数学评论（网络版）,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:48433