东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

数据中心网络高效数据汇聚传输算法

ISSN号：0254-4164
期刊名称：《计算机学报》
时间：0
分类：TP393[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]数学工程与先进计算国家重点实验室,江苏无锡214125, [2]解放军信息工程大学国家数字交换系统工程技术研究中心,郑州450002, [3]信息系统工程国防科技重点实验室(国防科学技术大学),长沙410073
相关基金：本课题得到国家“九七三”重点基础研究发展规划项目青年科学家专题项目（2014CB347800）、国家自然科学基金优秀青年基金（61422214）、国家自然科学基金（91430214）、国家“八六三”髙技术研究发展计划项目基金（2013AA01A213）资助.

关键词：数据中心, 数据汇聚, 网内聚合, 混洗传输, incast树, data center, data aggregation, in-network aggregation, shuffle transfer, incast tree

中文摘要：

在数据中心中,类MapReduce的分布式计算系统在数据的混洗阶段产生巨大流量,令数据中心的东西向网络资源成为瓶颈.将这些高度相关的数据流在接收端进行聚合是分布式计算的通用处理方式,为了降低网络通信量并有效利用带宽,文中采用网内关联性流量的汇聚传输策略,将混洗和汇聚并行化,达到进一步降低东西向网络资源消耗、缩短混洗阶段延迟的目的.目前提出的IRS-based算法在适用场景上有一定局限性,为了解决这一问题,文中首先在以服务器为中心的代表结构BCube上建立incast最小树模型,分别提出MIB-based算法和MC-based算法,仅根据已知拓扑结构和发送节点编号即可快速生成一棵近似的最小代价incast树.MIB-based算法针对发送节点强关联的情况,使高层发送节点尽可能汇聚到已有的低层发送节点构建incast树;MC-based算法针对发送节点松散关联的情况,将节点进行最大程度上的聚合,通过增加最少的汇聚点完成incast树的构建.随后将上述两种算法结合起来进一步提出适用于各种场景的M2-based算法,通过推算时间复杂度证明该算法能够满足在线构建incast树的需求.最后,详细分析了M2-based算法对其他数据中心网络结构的适应性以及网内汇聚传输能够减少作业完成时间的原理.小规模实验结果表明,在不同网络规模下,M2-based比IRS-based节省了网络中约3%的数据量,整个作业在混洗和Reduce阶段的等待时间比不采用网内汇聚缩短约2/3;在不同传输节点规模下,M2-based比IRS-based节省了网络中约19%的数据量,整个作业在混洗和Reduce阶段的等待时间比不采用网内汇聚缩短约3/4.

英文摘要：

In data centers, distributed computing systems like MapReduce produce massive amount of traffic across successive processing stages. Such shuffle transfers make east-west network resource become a bottleneck. In many commonly used workloads, data flows from all senders to each receiver are typically highly correlated. Many state-of-the-practice systems thus already apply aggregation functions at the receiver side of a shuffle transfer to reduce the output data size. To lower down the network traffic and efficiently use network bandwidth, we introduce in-network aggregation for associated traffic and parallelize the shuffle and reduce phases. It can significantly reduce consuming the rare east-west network resource, and avoid long latency time produced by the shuffle phase in MapReduce jobs. IRS-based algorithm proposed currently has certain limitations. To solve this problem, we first built a model for incast minimal tree with BCube, a representative server-centric networking structure for future data centers, and propose two approximate incast tree construction methods named MIB-based and MC-based, respectively, solely based on the labels senders and the data center topology. MIB-based method is applied to the case of highly correlative senders. It can build an incast minimal tree by making an endeavor to aggregate the high-level senders to low-level senders. MC-based method is applied to the case of loose associative senders. It can build an incast minimal tree by aggregating nodes as far as possible and increasing the least nodes. Then we combined two methods and further proposed M2-based method for any case. It proved that the method we proposed can meet the demand of building the incast tree on line by calculating the time complexity of the M2-based incast tree building method. At last, we analyzed the adaptability of M2-based to other data center structures, and the principle of in-network aggregation in reducing the job execution time. The small-scale experimental results show that, in the different size o

同期刊论文项目

面向100PF级计算机的三类共性算法研究及高效实现

期刊论文 7

　软件定义的云数据中心网络基础理论与关键技术

期刊论文 7

网络系统架构与资源管理基础理论和关键技术研究

期刊论文 3

同项目期刊论文

MDCent:一种高可扩展、高吞吐量的模块间互连结构

GPU加速不完全Cholesky分解预条件共轭梯度法

Comparing Set Reconciliation Methods Based on Bloom Filters and Their Variants

数据中心内Incast流量的网内聚合研究

A Fully Pipelined Probability Density Function Engine for Gaussian Copula Model

A survey of network update in SDN

Comparing Set Reconciliation Methods Based on Bloom Filters and Their Variants

软件定义的云数据中心网络基础理论与关键技术

SwiftArray： Accelerating Queries on Multidimensional Arrays

数据中心网络的研究进展与趋势

面向神威·太湖之光的PETSc可扩展异构并行算法及其性能优化

众核处理器的流水线紧耦合指令循环缓存设计

基于聚集混合粗化的代数多重网格并行算法

基于蚁群平台的大规模分布式XML数据库

期刊信息

《计算机学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国计算机学会中国科学院计算技术研究所
主编：孙凝晖
地址：北京中关村科学院南路6号
邮编：100190
邮箱：cjc@ict.ac.cn
电话：010-62620695

国际标准刊号：ISSN：0254-4164
国内统一刊号：ISSN：11-1826/TP
邮发代号:2-833

获奖情况:
中国期刊方阵“双效”期刊

国内外数据库收录:
美国数学评论（网络版）,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:48433