为加快表面三维形貌分析中高斯滤波算法的执行速度,提出了一种基于计算统一设备构架(CUDA)的高斯滤波算法来实现高速并行处理.分析高斯滤波算法原理和CUDA并行计算体系,将CUDA并行计算技术引入到表面分析领域.针对高斯滤波数据间依赖性弱和CUDA采用单指令多线程(SIMT)执行模型的特点,总结出适合于cUDA的并行高斯滤波算法流程.实验证明:该方法与CPU串行处理方法相比,其加速比达到40倍以上,可以有效提高数据处理能力.
In order to speed up the Gaussian filtering in three-dimensional surface texture analysis, an efficient method based on compute unified device architecture (CUDA) implemented on graphic pro- cessing unit (GPU) was designed. CUDA parallel computing technology was introduced into the sur- face texture analysis area by analyzing principle of Gaussian filtering and CUDA computing architec- ture. Parallel Gaussian filtering algorithm used in CUDA was given for the characteristics of low de- pendency of Gaussian filtering and SIMT (single instruction multiple thread) execution model of CUDA. Experiments prove that the calculation speed based on CUDA is 40 times faster than that of traditional sequence algorithm based on CPU, can effectively improve data processing capability.