位置:成果数据库 > 期刊 > 期刊详情页
有效的XML模糊内容与结构检索和计分
  • 期刊名称:计算机研究与发展
  • 时间:0
  • 页码:1070-1078
  • 语言:中文
  • 分类:TP311.13[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]江西财经大学信息管理学院,南昌330013, [2]江西省高校数据与知识工程重点实验室,南昌330013
  • 相关基金:国家自然科学基金项目(60763001 60803105/F020606); 国家社会科学基金项目(07BTQ025); 江西省自然科学基金项目(2007GZS0082); 江西省教育厅科技重点项目(GJJ08506 GJJ08507)
  • 相关项目:面向查询的XML文本自动文摘研究
中文摘要:

XML文档包含有内容和结构,除了可以进行纯内容(CO)检索外,还可以进行内容和结构(CAS)检索.提出了一种新的CAS检索方法,这种方法以内容检索为主,结构匹配为辅,结构约束主要影响结点的计分,而不是答案结点的选择.这种方法分3步进行:首先,一个CAS查询被分解为若干个查询片段;然后处理每个查询片段;最后,将每个查询片段得到的部分查询结果综合起来,得到最终的查询结果.提出了一种新的计分方案,它首先计算一个查询结果在每个查询片段上的得分,然后将这些得分总和起来得到最终得分.提出的计分方法根据检索结果内容和结构两方面的相关性计分,更符合用户查询意图和查询语义.大量的实验结果验证了提出方法的有效性.

英文摘要:

XML documents involve both contents and structures,and can be retrieved by means of not only content-only (CO) but also content-and-structure (CAS) queries. In this paper,a novel approach for CAS retrieval is proposed. The approach proceeds in three steps: it first decomposes a CAS query into a set of query fragments,and then processes each query fragment. Finally,it combines results on each query fragments. By this approach,on the one hand,the adverse effects of structural vagueness on answer nodes selection can be removed; on the other hand,the effect of structural constraints on scoring is incorporated properly. The features of this approach make it applicable in versatile homogeneous and heterogeneous data environments. To measure the relevance query results to a given CAS query,a novel scoring scheme is presented. In accordance with the query processing approach,the scoring method first computes the scores of a query result with respect to each query fragment,and then combines these partial scores to arrive at an overall score. The proposed scoring method considers the relevance of both contents and structures in the retrieval results,and thus reflects the user's query intention and conforms to query semantics. Comprehensive experimental studies demonstrate the effectiveness of the proposed methods.

同期刊论文项目
期刊论文 21 会议论文 15
期刊论文 33 会议论文 14 获奖 2 著作 1
同项目期刊论文