分布式网络中的skyline查询是近年来信息检索学科的一个研究重点.目前大多数研究工作均没有考虑在分布式网络中,如何有效降低skyline查询的网络传输代价.为此,提出一种在分布式网络中,有效降低skyline查询传输代价的方法RTCSQDN(Reducing the Transferring Cost of Skyline Queries over Distributed Networks).RTCSQDN算法充分利用父空间skyline对象集与子空间skyline对象集间的语义关系通过三个阶段来平衡网络传输量和查询时间开销.同时,文章给出一种新颖的多维对象传送策略PTGPV(Policy for Transferring Grouping Position Values)来避免直接传送skyline对象本身,从而最小化数据传输量.详细的理论分析和大量实验评估表明,文章给出的算法具有有效性和实用性.
Skyline query processing in distributed environments has recently received a lot of attention in information retrieval community.However,most existing literatures do not consider how to efficiently reduce the transferring cost of skyline quereis on distributed networks.In this paper,we propose RTCSQDN(Reducing the Transferring Cost of Skyline Queries over Distributed Networks),the efficient sound and complete algorithm for balancing the cost of transferring data and skyline computation cost on the distributed networks.Specially,the PDSQDN algorithm makes use of the semantic relationship between parent-space skylines and child-space skylines,and transfers the data through three phases.Moreover,we present a novel policy,i.e.PTGPV(Policy for Transferring Grouping Position Values),to transfer multi-dimensional objects.The PTGPV policy only transfers the position values of objects for most dimensional-spaces,and hence it can efficiently minimize the volume of data transferred.We also present detailed theoretical analyses and extensive experiments that demonstrate our algorithms are both efficient and effective.