本文针对科学应用中基本函数种类多、实现复杂、使用频率低的特点,提出一种定制VLIW结构四精度浮点基本函数协处理器(QPC-Processor).该结构通过显示并行技术挖掘基本函数实现算法的并行性,在同一硬件平台上通过元操作的不同组合来计算多种基本函数.同时,本文还提出基本函数元操作序列到定制VLIW指令的映射算法,指导基本函数的设计.最后,在FPGA平台上进行验证.实验结果表明,相对软件实现,单个QPC-Processor能够取得6倍以上的加速比,而且,QFC-Processor在同一硬件平台上实现多种类型的算法,弥补单一算法的不足,获得较高的硬件资源利用率.
This paper proposes a quadruple precision floating-point elementary function co-processor based on very large instruction word(VLIW) structure(QPC-Processor),which exploits the parallelism through the explicitly parallel technology of the VLIW structure.Variety of quadruple precision elementary functions is evaluated via the different combination of basic operation in the unified hardware.Finally,we prototype the QPC-Processor units into FPGA chip.The experimental results show our design outperforms the software approach by a factor of more than 6.Moreover,high utilization of hardware resource can be obtained in QFC-Processor.