位置:成果数据库 > 期刊 > 期刊详情页
基于穿行次数的大规模图数据路径查询
  • 期刊名称:计算机研究与发展
  • 时间:0
  • 页码:96-103
  • 语言:中文
  • 分类:TP311.13[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]北京大学信息科学技术学院计算机科学技术系,北京100871
  • 相关基金:基金项目:国家自然科学基金项目(60873062);国家“八六三”高技术研究发展计划基金项目(2007AA012191,2006AA012230)
  • 相关项目:大规模图数据正则路径查询关键技术研究
中文摘要:

在涉及复杂图(graph)数据的场景中,图的距离查询和路径查询有着重要的应用.有些应用涉及到规模巨大的图,并且要求快速的查询响应.为此需要高效的查询策略.通过研究可以发现,图内部节点的重要程度往往是不同的,并且可以利用节点的"穿行次数"度量节点的重要性.根据穿行次数为节点构建标签,并保证仅根据节点标签就能处理图的距离查询和路径查询,从而避免对图的遍历,这是一个基本的查询策略.这些标签的规模要尽量小,以降低空间开销、提高查询速度 而其构建过程却要足够快,以保证构建效率.将这个基于穿行次数的查询处理策略称为"穿行次数算法",最终的实验结果验证了该算法的有效性.

英文摘要:

Distance and path queries in graphs are fundamental to numerous applications,ranging from geographic navigation systems to Internet routing. Some of these applications involve huge graphs and yet require fast query answering. A new data structure is created for representing all distances in a graph. The data structure is distributed in the sense that it may be viewed as assigning labels to the vertices,such that a query involving vertices u and v may be answered using only the labels of u and v. In this paper,a new concept "pass count" is proposed to measure the importance of a vertex in the graph. Based on the pass-count,a special heuristic rule is used to build distance labels for each vertex of the graph,so distance queries and path queries can be processed on those labels exclusively. This method is efficient by avoiding graph traversal,and hence reduces time complexity. A "pass count algorithm" is also presented based on the pass-count of each vertex in the graph,which aims to minimize labels' size,improve the query processing,and accelerate the construction of the labels. Extensive experiments are conducted to prove the effectiveness and efficiency of the algorithm.

同期刊论文项目
同项目期刊论文