位置:成果数据库 > 期刊 > 期刊详情页
基于格空间的受限Deep Web数据抽取算法
  • 期刊名称:模式识别与人工智能
  • 时间:0
  • 页码:130-137
  • 分类:N[自然科学总论]
  • 相关基金:Supported by the National Natural Science Foundation of China (60970018)
  • 相关项目:Web社区用户个性挖掘与排序研究
中文摘要:

In this paper,we present a novel approach utilizing attributes correlation for the sampling task on nonuniform hidden databases. We propose the method of calculating the attributes dependency and construct the sampling template according to the attributes dependency. Then,we use the sampling template to gen-erate initial sampling queries and propose a bottom-up algorithm to search the sampling template. We also conduct extensive ex-periments over real deep Web sites and controlled databases to illustrate that our sampling method has good performance both on the quality and efficiency.

英文摘要:

In this paper,we present a novel approach utilizing attributes correlation for the sampling task on nonuniform hidden databases. We propose the method of calculating the attributes dependency and construct the sampling template according to the attributes dependency. Then,we use the sampling template to gen-erate initial sampling queries and propose a bottom-up algorithm to search the sampling template. We also conduct extensive ex-periments over real deep Web sites and controlled databases to illustrate that our sampling method has good performance both on the quality and efficiency.

同期刊论文项目
期刊论文 15 会议论文 6 专利 3
同项目期刊论文