随着多核处理器体系结构在高性能计算领域日益广泛的应用,面向共享存储并行程序的容错问题成为研究的热点.近年来,检查点技术已经成为该领域占主导地位的容错机制.目前已有一些针对OpenMP程序检查点技术的研究工作,但其中绝大多数解决方案都依赖于特殊的运行时库或硬件平台.该文提出一种编译辅助的OpenMP应用级检查点,它是一种平台无关的方案,通过面向OpenMP的扩展数据流分析选择那些"必需"的变量保存到检查点映像,从而降低容错的开销,同时通过运行一种非阻塞式的协议维护检查点的全局一致性.文章讨论了该机制的各个关键问题,并通过实验评测以及与同类工作的比较,表明了该文所提出的检查点机制在容错性能方面的优势.
formance research. Recently, posed. H paper proposes a compiler-assisted application level cheekpointing for OpenMP programs. It is a platform-independent scheme, and through the extended static data flow analysis, it automatical- ly chooses those 'must-be-saved' variables to save in the checkpoint image, to reduce the over- head. It also maintains the global coherence of checkpoints by running a non-block protocol. In this paper, the key issues in the approach are discussed in detail, and the experimental result and the comparison with similar works show the proposed approach achieves promising performance.