根据实际开发项目经验,分析了H.264视频压缩编码算法的基本原理,如整数DCT变换、多参考帧的运动估、帧内预测、帧间预测;介绍了TMS3206416芯片的内核结构和外围设备,并结合TMS3206416嵌入式系统的特性、程序级代码优化与汇编级优化的方法,应用指令流水线、循环展开、指令并行、数据打包等技术,用汇编语言优化了部分DCT模块。通过实验结果的对比,对H.264中影响编码速度的瓶颈函数取得了较好的优化结果。这些方法和思路具有实际意义,为其他开发视频系统提供了较强的参考价值。
In this paper,analyze the principle of H.264 video compression coding algorithm according to the practical development experience,such as DCT transformation,motion estimation of multiple reference frame,intra-frame prediction and inter-frame prediction;introduce the kernel architecture and peripheral equipment of TMS3206416 chip,and combine the characteristics of TMS3206416 with the code optimization method of program level and assembly level,pipeline,loop unrolling,instruction parallelism and data packaging are used for optimizing part of DCT module.Finally,by comparing the experimental result,know that the bottle-neck function which affecting coding speed was optimized.These methods and thoughts are of great significance and provide good reference value for the development of other video system.