针对嵌入式纹理映射过程中处理速度慢和存储带宽对系统性能制约等问题,提出一种专用纹理映射单元体系结构,对单精度浮点除法器和纹理Cache进行优化设计。采用较小查找表结合二次多项式逼近算法实现浮点除法运算,根据纹理采样的不同方式,对纹理Cache的映射方式在直接映射、两路组关联和四路组关联之间进行动态配置。对纹理映射单元在SMIC0.13ktmCMOS工艺下进行仿真,结果表明,纹理映射模块工作主频可达313MHz,对于480x640像素,帧率可达51f/s,能够满足移动设备对实时渲染的需求。
This paper presents a dedicated architecture of texture mapping unit with an optimized design of single-precision floating-point divider and texture Cache, against the processing speed and memory bandwidth problem. It uses a small look-up table with quadratic polynomial approximation algorithm to achieve floating-point division. According to different sampling methods, it configures Cache mapping strategy among direct mapping, two-way set associative mapping and four-way set associative mapping. The design of the texture mapping unit is synthesized under SMIC0.13 pm CMOS process technology. Experimental results show that the design can achieve frame rate up to 51 f/s for 480x640 resolution at frequency up to 313 MHz, it is efficient for real time rendering for mobile devices.