东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

面向中国手语合成的视频语义描述方法

ISSN号：0254-0037
期刊名称：北京工业大学学报
时间：2012
页码：730-735
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]北京工业大学计算机学院多媒体与智能软件技术北京市重点实验室,北京100124
相关基金：国家自然科学基金资助项目（61170104）; 北京市自然科学基金资助项目（4102009）.致谢感谢北京第三聋哑学校为手语视频录制提供的大量帮助.
相关项目：基于多粒度视频基元的中国手语合成方法研究

关键词：手语合成, 语义描述, 语义模型, sign language synthesis, semantic description, semantic model

中文摘要：

为提高手语合成视频的真实感,提出一种面向手语合成的视频语义描述方法,并基于语义描述构建出相应的视频数据库.采集特定研究领域的手语视频数据,按照词义把源视频切分成词条基元和基于人体-部件的多层次过渡基元,通过对视频基元每帧图像进行语义描述来建立它们的多维语义模型.每个视频基元的多维语义模型代表了该视频每帧图像所包含的具体手语信息,包括位置、手形、韵律等.在手语合成过程中,通过解析视频的多维语义模型即可实时地调用有用的信息.该视频语义描述方法可为手语合成提供实时一致的语义理解,并且在拼接2段不同韵律的手语视频时,可通过解析出的韵律信息适当地调整过渡帧的插值位置,进而合成韵律一致的过渡视频.

英文摘要：

To improve synthesis realistic of sign language videos,a method to describe sign language video semantics is proposed,and the sign language video database based on semantic description for sign language synthesis is constructed.Chinese sign language videos in specific research field are captured,then sign language video units and multi-dimensional transition units are cut from the captured sign language videos.By describing the semantic information of every frame in sign language videos,which include locations,hand shapes and rhythm information,their multi-dimensional semantic models are constructed.During sign language video synthesis,useful information can be used in real-time by parsing multi-dimensional semantic models.This method provides real-time and coherent semantic information for sign language video synthesis,and in the process of joining two sign language videos,different rhythm information can be parsed out from their semantic models,then interpolated locations of transition frames can be moderately adjusted to make the rhythm in transition frames gradually change.

同期刊论文项目

　基于压缩感知理论的图像/视频编解码技术研究

期刊论文 3

基于多粒度视频基元的中国手语合成方法研究

期刊论文 9 会议论文 6

同项目期刊论文

Synthesis of sign language co-articulation based on key frames

Synthesis of sign language co-articulation based onkey frames

Chinese Sign Language Animation Generation Considering Context

Capture Surface Light Field for Gesture with Sparse Multi-view Videos

Adaptive particle shape setting and normal calculation methods in fluid rendering

High-resolution Light Field Capture with Coded Aperture

基于稀疏表示模型的图像解码方法

融合多通道信息的二维人脸识别

期刊信息

《北京工业大学学报》
中国科技核心期刊

主管单位:北京市教委
主办单位:北京工业大学
主编：卢振洋
地址：北京市朝阳区平乐园100号
邮编：100124
邮箱：xuebao@bjut.edu.cn
电话：010-67392535

国际标准刊号：ISSN：0254-0037
国内统一刊号：ISSN：11-2286/T
邮发代号:2-86

获奖情况:
中国高等学校自然科学学报优秀学报二等奖,北京市优秀期刊,华北5省市优秀期刊,中国期刊方阵“双效”期刊

国内外数据库收录:
俄罗斯文摘杂志,美国化学文摘（网络版）,美国数学评论（网络版）,德国数学文摘,荷兰文摘与引文数据库,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:11924