位置:成果数据库 > 期刊 > 期刊详情页
基于简化Trace的动态隐式断言执行
  • 期刊名称:计算机学报,2007,30(11):1972-1981.
  • 时间:0
  • 分类:TP302[自动化与计算机技术—计算机系统结构;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]国防科技大学计算机学院分布与并行处理国家重点实验室,长沙410073
  • 相关基金:本课题得到国家自然科学基金项目“高效能并行计算机体藤结构研究”(60633050)资助.致谢 同组的曹宏嘉博士建立了动态优化测试平台,王晓东副教授对实验的选择和论文的撰写提出了许多宝贵的建议,在此表示感谢!
  • 相关项目:高效能并行计算机体系结构研究
中文摘要:

分支指令与分支预测失败限制了处理器发掘指令级并行(ILP)的潜力.通过If—conversion或Predicated执行将程序中的控制相关转化为数据相关,能较好地降低分支预测开销.提出一种基于简化Trace结构的动态隐式断言执行机制(Dynamic Implicit Predication,DIP),而早期的相关研究主要集中于由编译器显式为宽发射处理器产生静态Predicated指令.无需编译器或者其他二进制工具的帮助,DIP可以在程序运行过程中识别可以进行断言变换的指令片断,完成指令转换与优化,并在以后的执行中使用优化后的指令Trace.基于SPEC2000模拟测试表明DIP可以有效避免错误的分支预测,提高并行度,单个程序的IPC平均提高10.3%,基准程序的平均加速比可达7.59%.

英文摘要:

To exploit instruction level parallelism, modern microprocessor usually converts control dependences into data dependences. If-conversion and predicated execution are widely adopted to eliminate branch misprediction penalty. In this paper, a trace-based predicate mechanism named DIP(Dynamic Implicit Predication) is discussed. Previous predication execution depends on compiler to generate explicit predicated instructions. The candidates of if-conversion will be identified during dynamic execution. Classical trace cache has been modified to store DIP traces, which include instructions both from fall-through and target block behind the conditional branch. Hardware will add predication to DIP trace automatically. With the help of DIP, legacy applications can benefit from predication mechanism without recompiling source code. Simulation of DIP under various hardware configurations is presented in the paper. Results have shown promising performance improvement. For SPEC INT2000 benchmark, average IPC (Instruction Per Cycle) improvement achieves 10.3%, and average speedup of execution time is 7.59%.

同期刊论文项目
期刊论文 72 会议论文 39 获奖 3
同项目期刊论文