位置:成果数据库 > 期刊 > 期刊详情页
VDoc+: A Virtual Document Based Approach for Matching Large Ontologies Using MapReduce
  • ISSN号:1869-1951
  • 期刊名称:Journal of Zhejiang University-Science C(Computers
  • 时间:2012.4.4
  • 页码:257-267
  • 分类:TP311[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]计算机软件新技术国家重点实验室(南京大学),南京210046
  • 相关基金:国家自然科学基金项目(61003018,61021062);国家社会科学基金项目(11AZD121);高等学校博士学科点专项科研基金项目(20100091120041);江苏省自然科学基金项目(BK2011189)
  • 相关项目:语义Web中对象共指的消解方法与技术
中文摘要:

伴随语义网的发展,语义网本体数量激增.然而万维网上绝大多数的数据仍存储在关系数据库中.建立关系数据库模式与语义网本体间的映射是一种实现两者之间互操作性的有效途径.因此,提出了一种基于语义的关系数据库模式与OWL本体间的映射方法SMap,包含简单映射发现和复杂映射学习两个阶段.在简单映射发现阶段,首先通过逆向工程规则将关系数据库模式和本体中的元素对应地分为不同类别,再为每个元素构建虚拟文档并计算它们之间的相似度,其中针对不同类别的元素设计了不同的虚拟文档抽取方案.在复杂映射学习阶段,基于已发现的简单映射以及重叠的数据库记录和本体实例,自动化地生成训练事实数据,然后运用归纳逻辑编程算法学习出多种类型的基于Horn规则的复杂映射.真实数据集上的实验结果表明,SMap在简单映射发现和复杂映射学.-j上均明显优于现有的关系数据库模式与本体间映射方法.

英文摘要:

Ontologies proliferate with the development of the semantic Web. Most data on the Web, however, are still stored in relational databases (RDBs). Creating mappings between RDB schemas and ontologies is an effective way for establishing the interoperability between them. In this paper, we propose SMap, a semantic approach to create mappings between RDB schemas and OWL ontologies. SMap consists of two main stages: finding simple mappings and learning complex mappings. In the first stage, reverse engineering rules are applied to classify the elements in an RDB schema and an ontology correspondingly into different categories, and the virtual documents for the elements are built in terms of their categories and then matched for similarities. In the second stage, based upon the pre-found simple mappings as well as some overlapped RDB records and ontology instances, the facts used for inductive logic programming (ILP) are automatically collected, which constitute the background knowledge and positive examples. Then, different types of Horn-rule-like complex mappings are learnt with a bottom-up ILP algorithm. Experimental results on real-world datasets demonstrate that, SMap outperforms existing approaches significantly on both simple mapping finding and complex mapping learning, and such Horn-rule-like mappings are of clear semantics and can be directly used for query rewriting.

同期刊论文项目
期刊论文 8 会议论文 11 专利 1
同项目期刊论文