东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

一种面向XML数据的SLCA求解算法

ISSN号：1007-791X
期刊名称：燕山大学学报
时间：2013.7.1
页码：339-346-
分类：TP311[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
作者机构：[1]燕山大学信息科学与工程学院,河北秦皇岛066004
相关基金：国家自然科学基金资助项目（61272124,61103139）;河北省重点基础研究资助项目（10963527D）
相关项目：基于mRNA结构信息挖掘及多机器学习方法融合的SiRNA设计算法研究

关键词： XML, 关键字查询, 列存储, 哈希, XML, keyword search, column storage, hash

中文摘要：

针对现有方法计算SLCA语义时存在冗余计算问题，提出了一种基于列存储的倒排索引，并结合哈希查找，以自项向下的方式查询处理的算法TDCOL-HS，来避免现有算法“公共祖先重复处理”的问题。算法以最短倒排表作为处理对象，将检测给定结点是否包含其他关键字的操作转化为哈希查找操作，其时间复杂度为O（mxL），最后通过比较各种指标，从不同角度对算法的性能进行了验证．

英文摘要：

Considering that existing methods suffer from redundant computation when processing XML keyword queries for SLCA semantics, we propose an efficient algorithm, namely TDCOL-HS, which processes the given query based on column storage and hash probe operation to avoid the problem of repeatedly computing common ancestor nodes. It takes the shortest list as the working list, and transforms the operation of testing whether a given node contains other keywords into hash probe operations, therefore, achieves the time complexity of O（mxL）. The experimental results demonstrate that the performance benefits of our methods in adding key word search on XML data.

同期刊论文项目