索引的一个新方法;在 XML 处理枝条模式记录在这篇论文被建议。在 XML 文件的每条路径能被转变成标签的一个序列由编码结构那构造在 XML 之间的一对一的通讯树;顺序。在 XML 树上识别节点的特征上的底,元素被分类;聚类。在询问继续期间,枝条模式也被转变成它的编码结构。由执行随后在 XML 文件在序列的集合上匹配,在 XML 文件的路径的所有出现被精制。用这个索引,检索的元素的数字被最小化。没有任何假打发或虚惊,有恰当的格式的搜索结果提供更多的结构信息。这个索引也支持关键词搜索。实验结果显示这个索引显著地与高精确有效率。
A new way of indexing and processing twig patterns in an XML documents is proposed in this paper. Every path in XML document can be transformed into a sequence of labels by Structure-Encoded that constructs a one-to-one correspondence between XML tree and sequence. Base on identifying characteristics of nodes in XML tree, the elements are classified and clustered. During query proceeding, the twig pattern is also transformed into its Structure-Encoded. By performing subsequence matching on the set of sequences in XML documents, all the occurrences of path in the XML documents are refined. Using the index, the numbers of elements retrieved are minimized. The search results with pertinent format provide more structure information without any false dismissals or false alarms. The index also supports keyword search Experiment results indicate the index has significantly efficiency with high precision.