东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

数据中心内Incast流量的网内聚合研究

ISSN号：1000-1239
期刊名称：《计算机研究与发展》
时间：0
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：信息系统工程国防科技重点实验室(国防科学技术大学),长沙410073
相关基金：国家自然科学基金优秀青年科学基金项目（61422214）

关键词：网内聚合, 数据中心, incast传输, shuffle传输, 网络流量, in-network aggregation, data centers, incast transfer, shuffle transfer, network transfer

中文摘要：

MapReduce等分布式计算系统应用在数据中心内产生了严重的东西向流量，其中以incast和shuffle为代表的关联性流量占相当大的比重，进而严重影响到上层应用的性能．这促使研究者们考虑在这些关联性流量的网内传输阶段尽可能早而不是仅在流量的接收端进行流问数据聚合．首先以新型数据中心网络结构为背景讨论流间数据聚合的可行性和增益，为最大化该增益，为incast传输建立最小代价树模型．为解决该模型，提出了2种近似的incast树构造方法，其能够仅基于incast成员的位置和数据中心拓扑结构生成一个有效的incast树，进一步解决了incast树的动态和容错问题．最后，采用原型系统和大规模仿真的方法评估了incast流量的网内聚合方法，实验结果证明该方法能大幅降低incast流量造成的传输开销，能节约数据中心的网络资源．同时，提出的模型和解决方法也适用于其他类型的数据中心网络结构．

英文摘要：

Data transfers, such as the common shuffle and incast communication patterns, contribute most of the network traffic in MapReduce like working paradigms and thus have severe impacts on application performance in modern data centers. This motivates us to bring opportunities for performing the inter-flow data aggregation during the transmission phase as early as possible rather than just at the receiver side. In this paper, we first examine the gain and feasibility of the inter-flow data aggregation with novel data center network structures. To achieve such a gain, we model the incast minimal tree problem. We propose two approximate incast tree construction methods, RS-based and ARS-based incast trees. We are thus able to generate an efficient incast tree solely based on the labels of incast members and the data center topology. We further present incremental methods to tackle the dynamic and fault-tolerant issues of the incast tree. Based on a prototype implementation and large-scale simulations, we demonstrate that our approach can significantly decrease the amount of network traffic, save the data center resources, and reduce the delay o for BCube and FBFLY can be adapted to other data centers structures of job processing. Our approach with minimal modifications.

同期刊论文项目

网络系统架构与资源管理基础理论和关键技术研究

期刊论文 3

同项目期刊论文

数据中心网络高效数据汇聚传输算法

Comparing Set Reconciliation Methods Based on Bloom Filters and Their Variants

期刊信息

《计算机研究与发展》
中国科技核心期刊

主管单位:中国科学院
主办单位:中国科学院计算技术研究所
主编：徐志伟
地址：北京市科学院南路6号中科院计算所
邮编：100190
邮箱：crad@ict.ac.cn
电话：010-62620696 62600350

国际标准刊号：ISSN：1000-1239
国内统一刊号：ISSN：11-1777/TP
邮发代号:2-654

获奖情况:
2001-2007百种中国杰出学术期刊，2008中国精品科...,中国期刊方阵“双效”期刊

国内外数据库收录:
俄罗斯文摘杂志,荷兰文摘与引文数据库,美国工程索引,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:40349