随着社交网络分析、语义Web分析、生物信息网络分析等新兴应用的快速增长,对亿万个顶点级别大规模图的处理能力的需求愈加迫切,这是当前高性能计算领域的研究和开发热点.文中结合云计算的特点,从图数据管理与图数据处理机制两个方面,综述了云计算环境下进行大规模图数据处理的关键问题,包括图数据的存储方式、图索引结构、图分割策略、图计算模型、消息通信机制、容错管理、可伸缩性、图查询处理等.全面总结了当前的研究现状和进展,详细分析了存在的挑战性问题,并深入探讨了未来的研究方向.
With the rapid growth of emerging applications like social network analysis,semantic Web analysis,and bioinformatics network analysis,it is urgent to require the processing capability on large scale graphs with billions of vertices,which is the hot topic of the research and development in the current high performance computing field.With the features of cloud computing and from the aspects of graph management and graph processing mechanisms,this paper surveys the key issues of large scale graph processing on cloud computing environments,including graph data storage scheme,index structure of graph data,graph partitioning strategy,graph computing model,message communication mechanism,fault-tolerance management,scalability,and graph query processing.This paper summarizes the state-of-art of current research works completely,analyzes the existing challenge problems in detail,and deeply explores the research directions in future.