针对当前XML文档信息查询算法的不足,提出一种基于有效路径权重的树匹配算法。在保持XML文档树有效结点和树结构的基础上,树根结点信息最重要,随着树深度增加,结点信息重要性逐渐减弱的特点,按照路径层次自动计算路径权重,并赋予相应路径,根据树结点的有效信息和树结构的有效路径计算树的匹配度。在大规模XML文档查询方面,实验验证了该算法在保证较高查准率和查全率的基础上,有效提高了查询效率。
Focusing on the deficiency of the current XML document information query algorithm,a matching algorithm based on effective path weight tree was proposed.On the basis of maintaining the XML document tree effective nodes and tree structure,the root node information was most important,and as the tree's depth increases,the importance of node information was gradually weakened.According to the path's layer,the path weights were automatically calculated and then given to the corresponding path.According to the effective information of tree nodes and the effective path of tree structure,the tree matching degree was calculated.For queries in large XML document,the experimental results demonstrate that the algorithm not only guarantees the high precision ratio and the recall ratio,but effectively improves the query efficiency.