基于blcr软件,在Linux内核层,设计会话断点保存与恢复软件。该软件可在同一个会话内、进程间实现同步断点保存与恢复,无须改变进程间的相互依赖关系。应用结果表明,将该软件集成到Torque/Maui集群管理和调度系统中,可对用户运行程序进行透明的断点保存与恢复。
Based on blcr software,this paper proposes designs session checkpoint preservation and recovery software on the core level of Linux.The software can realize synchronous checkpoint preservation and recovery in the same session and process.It can not change the mutual dependency relationship between processes.Application results show that it integrates this software into Torque/Maui cluster management and scheduling system,it can realize checkpoint preservation and recovery transparently for user operation procedure.