随着信息技术的飞速发展,现实世界中涌现出大量的跨媒体数据.所谓跨媒体数据是指那些表达的内容相似,但以不同模态、不同来源、不同背景等形式出现的数据.比如,一张描述花豹的网页通常采用共生的图片和文本等不同的模态刻画花豹的外形和习性.这些跨媒体数据通常呈现出底层特征异构、高层语义相关的特性.传统的单媒体学习方法已无法适应跨媒体数据呈现出的特征异构性.因此,跨媒体学习相关理论与方法的研究是当前数字媒体分析领域的热点研究课题之一.该文主要介绍了跨媒体学习的研究背景和应用价值,概括介绍了各类跨媒体学习相关方法的数学原理和基本特性,并重点介绍了跨媒体共享子空间学习的研究进展,比较了基于投影、矩阵分解、任务和度量等四大类子空间学习方法的优缺点,分析了未来的发展方向.
With the rapid development of information technology, there are many CMD (Cross- Media Data) in the real world. The so-called cross-media data refer to information items with similar underlying contents, which arrive in different modalities, sources or backgrounds, and so on. For example, a webpage describing leopards uses co-occurring text and image of different modalities to represent leopards. These CMD show characteristics of the heterogeneity of low- level features and the correlation of high-level semantics. Traditional Mono-Media Learning (MML) methods have not been able to adapt to the feature heterogeneity of CMD. Thus, the research on related problems in CML (Cross-Media Learning) have been One of the hot research topics in the field of digital media analysis recently. This paper mainly introduces the research background and application value of CML, and provides an overview of mathematical principle and specialties of various related methods in CML. Meanwhile, the advances in cross-media shared subspace learning are presented. Furthermore, the paper compares the advantages and disadvantages of four kinds of subspace learning task and measurement, respectively. Finally. methods based on projection, matrix decomposition, the future development of CMI. is analyzed.