位置:成果数据库 > 期刊 > 期刊详情页
Scalability of 3D deterministic particle transport on the Intel MIC architecture
  • ISSN号:1001-8042
  • 期刊名称:Nuclear Science and Techniques
  • 时间:2015.10
  • 页码:-
  • 分类:TP3-4[自动化与计算机技术—计算机科学与技术] TP393[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]Science and Technology on Parallel and Distributed Processing Laboratory, National University of Defense Technology, Changsha 410073, China, [2]Science and Technology on Space Physics Laboratory, Beijing 100076, China
  • 相关基金:Supported by National Natural Science Foundation of China(Nos.61402039,61170083,60970033,61373032 and 91430218);National High Technology Research and Development Program of China(No.2012AA01A301);China Postdoctoral Science Foundation(No.2014M562570);National Key Basic Research Program of China(No.61312701001)
  • 相关项目:基于64位RISC、共享前端SIMD多核架构的同构通用流处理器体系结构
中文摘要:

The key to large-scale parallel solutions of deterministic particle transport problem is single-node computation performance. Hence, single-node computation is often parallelized on multi-core or many-core computer architectures. However, the number of on-chip cores grows quickly with the scale-down of feature size in semiconductor technology. In this paper, we present a scalability investigation of one energy group time-independent deterministic discrete ordinates neutron transport in 3D Cartesian geometry(Sweep3D) on Intel’s Many Integrated Core(MIC) architecture, which can provide up to 62 cores with four hardware threads per core now and will own up to 72 in the future. The parallel programming model, Open MP, and vector intrinsic functions are used to exploit thread parallelism and vector parallelism for the discrete ordinates method, respectively. The results on a 57-core MIC coprocessor show that the implementation of Sweep3 D on MIC has good scalability in performance. In addition, the application of the Roofline model to assess the implementation and performance comparison between MIC and Tesla K20 C Graphics Processing Unit(GPU) are also reported.

英文摘要:

The key to large-scale parallel solutions of deterministic particle transport problem is single-node computation performance. Hence, single-node computation is often parallelized on multi-core or many-core computer architectures. However, the number of on-chip cores grows quickly with the scale-down of feature size in semiconductor technology. In this paper, we present a scalability investigation of one energy group time-independent deterministic discrete ordinates neutron transport in 3D Cartesian geometry(Sweep3D) on Intel’s Many Integrated Core(MIC) architecture, which can provide up to 62 cores with four hardware threads per core now and will own up to 72 in the future. The parallel programming model, Open MP, and vector intrinsic functions are used to exploit thread parallelism and vector parallelism for the discrete ordinates method, respectively. The results on a 57-core MIC coprocessor show that the implementation of Sweep3 D on MIC has good scalability in performance. In addition, the application of the Roofline model to assess the implementation and performance comparison between MIC and Tesla K20 C Graphics Processing Unit(GPU) are also reported.

同期刊论文项目
期刊论文 26 会议论文 11 专利 3
同项目期刊论文
期刊信息
  • 《核技术:英文版》
  • 主管单位:中国科学院
  • 主办单位:中国科学院上海应用物理研究所 中国核学会
  • 主编:马余刚
  • 地址:上海市800-204信箱
  • 邮编:201800
  • 邮箱:nst@sinap.ac.cn
  • 电话:021-39194048
  • 国际标准刊号:ISSN:1001-8042
  • 国内统一刊号:ISSN:31-1559/TL
  • 邮发代号:4-647
  • 获奖情况:
  • 1996年获中科院优秀期刊三等奖
  • 国内外数据库收录:
  • 俄罗斯文摘杂志,美国化学文摘(网络版),美国科学引文索引(扩展库),英国科学文摘数据库,英国英国皇家化学学会文摘
  • 被引量:57