东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

VDoc+: A Virtual Document Based Approach for Matching Large Ontologies Using MapReduce

ISSN号：1869-1951
期刊名称：Journal of Zhejiang University-Science C(Computers
时间：2012.4.4
页码：257-267
分类：TP311[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
作者机构：[1]计算机软件新技术国家重点实验室(南京大学),南京210046
相关基金：国家自然科学基金项目（61003018,61021062）;国家社会科学基金项目（11AZD121）;高等学校博士学科点专项科研基金项目（20100091120041）;江苏省自然科学基金项目（BK2011189）
相关项目：语义Web中对象共指的消解方法与技术

关键词：本体映射, 模式匹配, 关系数据库, 虚拟文档, 归纳逻辑编程, ontology mapping, schema matching, relational database, virtual document, inductivelogic programming

中文摘要：

伴随语义网的发展，语义网本体数量激增．然而万维网上绝大多数的数据仍存储在关系数据库中．建立关系数据库模式与语义网本体间的映射是一种实现两者之间互操作性的有效途径．因此，提出了一种基于语义的关系数据库模式与OWL本体间的映射方法SMap，包含简单映射发现和复杂映射学习两个阶段．在简单映射发现阶段，首先通过逆向工程规则将关系数据库模式和本体中的元素对应地分为不同类别，再为每个元素构建虚拟文档并计算它们之间的相似度，其中针对不同类别的元素设计了不同的虚拟文档抽取方案．在复杂映射学习阶段，基于已发现的简单映射以及重叠的数据库记录和本体实例，自动化地生成训练事实数据，然后运用归纳逻辑编程算法学习出多种类型的基于Horn规则的复杂映射．真实数据集上的实验结果表明，SMap在简单映射发现和复杂映射学．-j上均明显优于现有的关系数据库模式与本体间映射方法．

英文摘要：

Ontologies proliferate with the development of the semantic Web. Most data on the Web, however, are still stored in relational databases （RDBs）. Creating mappings between RDB schemas and ontologies is an effective way for establishing the interoperability between them. In this paper, we propose SMap, a semantic approach to create mappings between RDB schemas and OWL ontologies. SMap consists of two main stages： finding simple mappings and learning complex mappings. In the first stage, reverse engineering rules are applied to classify the elements in an RDB schema and an ontology correspondingly into different categories, and the virtual documents for the elements are built in terms of their categories and then matched for similarities. In the second stage, based upon the pre-found simple mappings as well as some overlapped RDB records and ontology instances, the facts used for inductive logic programming （ILP） are automatically collected, which constitute the background knowledge and positive examples. Then, different types of Horn-rule-like complex mappings are learnt with a bottom-up ILP algorithm. Experimental results on real-world datasets demonstrate that, SMap outperforms existing approaches significantly on both simple mapping finding and complex mapping learning, and such Horn-rule-like mappings are of clear semantics and can be directly used for query rewriting.

同期刊论文项目

语义Web中对象共指的消解方法与技术

期刊论文 8 会议论文 11 专利 1

语义网应用技术体系和发展战略研究

期刊论文 6

同项目期刊论文

SMap: 基于语义的关系数据库模式与OWL本体间映射方法

语义Web课程建设初探

语义Web中对象共指的消解研究

基于MapReduce的对象共指消解方法

语义Web课程建设初探

基于特征选择和点互信息剪枝的产品属性提取方法

语义Web中对象共指的消解研究

一种基于HBase的RDF数据存储模型

基于无指导学习的微博评论分析方法