东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于压缩域的多路视频流混合方法

ISSN号：1002-0470
期刊名称：高技术通讯
时间：2012
页码：42-47
分类：TN948.4[电子电信—信号与信息处理;电子电信—信息与通信工程]
作者机构：[1]武汉大学国家多媒体软件工程技术研究中心,武汉430072
相关基金：973计划（2009CB320906）,国家重大科技专项（2010ZX03004-003-03）和国家自然科学基金（61003184,60970160,61070080）资助项目.
相关项目：基于时域选择特性和分级掩蔽的视频感知编码研究

作者： Han Zhen|胡瑞敏|常军|钟睿|韩镇|Wang Zhongyuan|Hu Ruimin|Chang Jun|Zhong Rui|

关键词：视频混合, 压缩域, 转码, 视频会议, video composition, DCT domain, transcoding, video conference

中文摘要：

针对视频会议应用中传统的像素域多路视频混合方法存在运算复杂度高、画面质量损伤的问题，提出了一种基于压缩域的替代方法，并给出了详细的码流映射算法步骤。该方法按照混合后多画面的空间位置关系，通过对输入的多路码流中宏块编码次序的重排和语法元素的映射，在码流级别将多路视频合成到同一画面中，并采取提前量化策略消除可能出现的二次量化失真，从而可兼具处理速度快和高保真的双重优点。以H．263为例验证了此方法的有效性。实验结果表明，与编解码器级联的方法相比，此方法的峰值信噪比（PSNR）平均提高2dB，运算效率提高百倍以上。此研究工作有望为正在制定的国际视频编码标准H．265贡献一种视频混合解决方案。

英文摘要：

Aiming at the problems of high computational complexity and picture quality degradation of traditional pixel domain video mixing methods useful for multipoint conferences, this paper proposes a multipoint video composition scheme based on the compressed domain of discrete cosine transform （ DCT）, and describes the details of the bitstream mapping algorithm. With the rearrangement of the macroblock coding order and the mapping of the syntax element, the scheme combines multiple channel video frames together into the unique picture on the syntax layer according to the spatial position relation of the composition stream, and then a pre-quantization policy is particularly presented to remove requantization errors. To verify the availability, the details of the algorithm are integrated into the H. 263 codec. The experimental results revealed that compared with the cascaded method, the average peak signal to noise ratio of the proposed method （PSNR） was improved almost 2dB and the operational efficiency increased a hundredfold. It is possible that this research can provide a video mixing solution for the international video coding standard H. 265 which is under development.

同期刊论文项目