为了同时做到应对千万亿次高性能计算的技术挑战和满足数据中心(datacenter)未来的主要应用模式效用计算(utility computing)的需求,提出了一种称为HPP(HyperParallelProcessing)的高性能计算机体系结构。HPP的主要特征是全局地址空间(globaladdressspace)和单一操作系统映像的超节点(hypernode)。HPP结合了MPP的可扩展性,DSM的高效通信和机群的普及化的优点,为高性能计算和效用计算都提供了许多创新研究的机会。基于HPP体系结构,实现了一个曙光5000高性能计算机的原型系统,初步验证了它的可行性。
An architecture of high performance computer, called Hyper Parallel Processing (HPP), is proposed to satisfy the requirements of both High Performance Computing, and Utility Computing which will be the application model of data centers. HPP combines the benefits of the scalability of MPP, the communication efficiency of DSM, as well as the commodity of cluster. Comparisons of current main-stream high performance computer architectures show that none of them can satisfy both HPC and utility computing. The main features of HPP are Global Address Space(GAS) and Hyper Node with single Op- erating System image. HPP supports the distributed global address space including both memory and I/O, but without hardware cache coherence. A Hyper Node consists of a set of application specific CPUs and a (or more) OS specific CPU. The OS CPU maintains the single system image, while the application CPUs run only lightweight run-time software. Besides the GAS interconnect network for applications, a standard SAN connects all OS CPUs and I/O devices providing system management and storage service. HPP is able to provide many opportunities of innovative research in High Performance and Utility Computing areas, including communication, synchronization, programming model, node operating system, utility computing, fault isolation, CPU and system etc. According to HPP architecture, a prototype system of Dawning5000 HPC is implemented and the feasibility of HPP is proved.