提出了一个并行矩阵乘算法IPBPMM(Interconnected Processor-Based Parallel Matrix Multiplication) 该算法运行在以五角形、Petersen图和Hoffman-Singleton图等直径为2的摩尔图(满足n=d^2+1,n为节点数,d为度)为拓扑结构的由n个独立处理器构成的机群并行计算环境中.与基于二维环绕网孔阵列拓扑结构的Cannon和Fox等并行矩阵乘法算法相比较,IPBPMM算法通信开销较小,加速比更高,同时还具有矩阵分块可随机分布在各个节点中,无需事先按一定规律装入各节点中的特点.同时IPBPMM算法也能很好地扩充到由多个直径为2的摩尔图为拓扑结构组合构成的并行计算环境中,且随着网络的扩大,算法的并行加速比更高.
A parallel matrix multiplication algorithm called IPBPMM (interconnected processorbased parallel matrix multiplication) algorithm is presented. The algorithm runs on parallel computing environment consisting of clusters of n independent processors connected using topology of Moore graph of diameter 2(satisfy n=d^2+1, n is number of nodes, d is degree), such as Pentagon, Petersen and Hoffman-Singleton graph. Compared with Canon and Fox parallel matrix multiplication algorithm which are based on the 2-D mesh ring interconnection network, the commu- nication cost is low and the divided sub-matrix can be randomly distributed among the processors in IPBPMM algorithm, without the need to load the sub-matrices into processors according to a specific rule. In addition, IPBPMM algorithm can be well scaled onto larger parallel computing networks with topological structure composed by combining multiple basic networks of Moore graph of diameter 2, and as network size grows, the parallel speed up of IPBPMM algorithm becomes even high.