在数据库研究领域,模式匹配和实体统一是被广泛关注的两个方向。随着对Web数据集成需求的增长,无论是在模式和实体层次,研究这两方面问题是很有实际意义的。当前的研究大多针对两项任务的其中之一。在文章中,基于模式匹配促进实体统一的新思路,提出了一种同时解决这两项任务的方法,实现了它们之间的相互促进机制。在现实的Web异构数据源场景中应用该方法,得到的查准率和查全率都很高,证明了该方法的正确性和有效性。
Schema matching and entity resolution have been two topics widely studied in the field of database research. With the rising demand in the Web data integration, both in schema and instance level, the study of the two tasks is becoming more practical importance. Most current study efforts at resolving one of the two matching tasks. In this paper, based on the new ideas of schema matching benefit from entity resolution, we propose a method that simultaneously attacks these two tasks and achieves a kind of improvement between them. By applying our method to a realistic Web heterogeneous data source scenario, we show that precision and recall are both quite high, and show this method's correctness and validity.