东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

Understanding co-run performance on CPU-GPU integrated processors： observations, insights, directions

ISSN号：1000-9825
期刊名称：《软件学报》
时间：0
分类：TP316[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术] U464[机械工程—车辆工程;交通运输工程—载运工具运用工程;交通运输工程—道路与铁道工程]
作者机构：[1]State Key Laboratory of High Performance Computing, Changsha 410073, China, [2]School of Computer, National University of Defense Technology, Changsha 410073, China
相关基金：This work was supported by the National High Technology Research and Development 863 Program of China under Grant No. 2012AA010905 and the National Natural Science Foundation of China under Grant Nos. 61272143 and 61472431.

作者： Qi ZHU[1,2], Bo WU[3], Xipeng SHEN[2], Kai SHEN[4], Li SHEN[1], Zhiying WANG[1]

关键词：循环特性, 优化策略, 线程, 软件, 系统, 中面, 并行模型, 优化技术, parallel programming model, optimization, thread level speculation, HEUSPEC, performance

中文摘要：

线程水平推测为线级的并行利用提供不仅一个简单平行编程模型，而且有效机制。思索的平行建模的软件的性能被环的不同类型引起的高全球的开销限制。这些环通常有相关性的不同特征和优化策略的不同要求。在这份报纸，我们建议三种全面优化技术减少全球开销的不同因素，瞄准从环的不同类型的要求。内部线的取能与经常的相关性减少环的高 mis 推测率，打乱次序的承诺能与很少发生的相关性减少环的控制开销，当缩放的提高的动态任务颗粒度能减少控制开销并且与改变相关性的特征优化环的全球开销时。所有这三种优化技术在 HEUSPEC 被实现了，一个软件 TLS 系统。试验性的结果显示他们能从基准的不同的组满足要求。这些技术的联合能改进所有基准的表演并且到达更高平均的加速。

英文摘要：

Thread level speculation provides not only a simple parallel programming model, but also an effective mech- anism for thread-level parallelism exploitation. The performance of software speculative parallel models is limited by high global overheads caused by different types of loops. These loops usually have different characteristics of dependencies and different requirements of optimization strategies. In this paper, we propose three comprehensive optimization techniques to reduce different factors of global overheads, aiming at requirements from different types of loops. Inter-thread fetching can reduce the high mis-speculation rate of the loops with frequent dependencies and out-of-order committing can reduce the control overhead of the loops with infrequent dependencies, while enhanced dynamic task granularity resizing can reduce the control overhead and optimize the global overhead of the loops with changing characteristics of dependencies. All these three optimization techniques have been implemented in HEUSPEC~ a software TLS system. Experimental results indicate that they can satisfy tile demands from different groups of benchmarks. The combination of these techniques can improve the performance of all benchmarks and reach a higher average speedup.

同期刊论文项目

多核平台下的高效线程级猜测执行机制研究

期刊论文 2

异构多核体系结构的能效优化关键技术研究

期刊论文 2

高效能众核异步微处理器设计关键技术研究

期刊论文 2

同项目期刊论文

多核处理器的功耗估算模型

Exploiting Parallelism in the Simulation of General Purpose Graphics Processing Unit Program

多核处理器的功耗估算模型

期刊信息

《软件学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国科学院软件研究所中国计算机学会
主编：赵琛
地址：北京8718信箱中国科学院软件研究所
邮编：100190
邮箱：jos@iscas.ac.cn
电话：010-62562563

国际标准刊号：ISSN：1000-9825
国内统一刊号：ISSN：11-2560/TP
邮发代号:82-367

获奖情况:
2001年入选中国期刊方阵“双百期刊”,2000年荣获中国科学院优秀科技期刊一等奖

国内外数据库收录:
俄罗斯文摘杂志,美国数学评论（网络版）,波兰哥白尼索引,德国数学文摘,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:54609