位置:成果数据库 > 期刊 > 期刊详情页
面向神威·太湖之光的PETSc可扩展异构并行算法及其性能优化
  • ISSN号:0254-4164
  • 期刊名称:《计算机学报》
  • 时间:0
  • 分类:TP301[自动化与计算机技术—计算机系统结构;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]湖南大学信息科学与工程学院国家超级计算长沙中心,长沙410082, [2]江南计算技术研究所,江苏无锡214125
  • 相关基金:本课题得到国家自然科学基金重点项目(61432005,91430214)、国家杰出青年科学基金(61625202)、国家自然科学基金(61602166)、数学工程与先进计算国家重点实验室开放基金课题、国家重点研发计划(2016YFB0201402,2016YFB0201900)资助.
中文摘要:

共性数学库PETSc(Portable,Extensible Toolkit for Scientific Computation)是高性能计算的基础模块,是超级计算机计算环境的基础算法库之一,其性能直接影响调用数学库的高性能数值计算应用的效率.面向国际上首台100P神威·太湖之光异构超级计算机,根据实际研究需要选取PETSc中两个典型用例ex5(单节点线性求解方程组问题)和exl9(多节点求解2D驱动腔问题)进行实验探究.对运行结果分析找到的热点函数主要为PETSc函数库中7个核心函数,针对这7个核心函数(主要包括向量运算与矩阵运算),提出和实现了其异构并行算法,并结合机器的异构体系结构提出了相应的性能优化方法.在超级计算机上的实验结果为:核心函数并行算法在4主核、256从核的单节点上加速比最大可达到16.4;多节点情况下,当输入规模为16384时,8192个节点相对于256节点的加速比为32,且加速比随着异构处理器数目的增加接近线性增加,表明PETSc核心函数并行算法在神威·太湖之光超级计算机上具有良好的可扩展性.

英文摘要:

Large-scale scientific and engineering calculations such as hydrodynamic calculations, numerical weather forecasting, seismic data processing, genetic engineering, and high-dimensional differential equations are facing with the big performance challenge. Meanwhile, the High Performance Computing (HPC) platform has been significantly developed in recent years. The appearances of multi-core processors and heterogeneous computing platforms dramatically improve the performance of high-performance applications. To fully utilize the computing power of HPC systems, it is necessary to develop specific methodologies to optimize the performance of applications based on the system architecture. The Sunway TaihuLight supercomputer is presently ranked in the TOP500 list as the fastest supercomputer in the world, with a LINPACK benchmark rating of 93 petaflops. The Sunway TaihuLight uses a total of 40960 Chinese designed SW26010 multi-core 64-bit RISC processors. Portable, Extensible Toolkit for Scientific Computation (PETSc), an indispensable module of high performance computing, is one of basic algorithm libraries widely applied in many high-performance applications. Meanwhile, PETSc is also widely used in partial differential equations, sparse linear algebra and other related problems. The performance of PETSc directly affects the efficiency of applications invoking PETSc. In this paper, we use two most typical cases in PETSc according to actual research needs, that is ex5 (solving problems of linear systems on single node) and ex19 (solving problems of 2D driving cavity on multi nodes) to perform them on the Sunway TaihuLight supercomputer. With the analysis of experimental results, we figure out there are seven core functions including vector calculations and matrix calculations. First of all, for each core function, we do an in-depth research of its characteristics, parallel difficulties, optimizations for the bottlenecks. And then, we determine an appropriate heterogeneous parallel model for these functi

同期刊论文项目
期刊论文 1
同项目期刊论文
期刊信息
  • 《计算机学报》
  • 北大核心期刊(2011版)
  • 主管单位:中国科学院
  • 主办单位:中国计算机学会 中国科学院计算技术研究所
  • 主编:孙凝晖
  • 地址:北京中关村科学院南路6号
  • 邮编:100190
  • 邮箱:cjc@ict.ac.cn
  • 电话:010-62620695
  • 国际标准刊号:ISSN:0254-4164
  • 国内统一刊号:ISSN:11-1826/TP
  • 邮发代号:2-833
  • 获奖情况:
  • 中国期刊方阵“双效”期刊
  • 国内外数据库收录:
  • 美国数学评论(网络版),荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版),中国北大核心期刊(2000版)
  • 被引量:48433