与普通XML文档相比,概率XML数据中节点的类型不唯一且节点的出现具有相应的概率。提出一种高效的编码策略DeweyTP,该编码策略为每个XML数据节点分配唯一的能够体现节点类型和路径概率的编码,来支持节点类型检测和路径概率提取,因而提升系统性能。最后通过实验从时间和空间两方面验证了DeweyTP编码的高效性。
Compared with ordinary XML documents, nodes in the probabilistic XML documents have two characteristics, the type of nodes was non-unique and the nodes exist with a corresponding probability. As an efficient labeling scheme, DeweyTP was proposed to assign each node a uniqtte label, which contains the type and path probability of nodes, supporting the detection of node type and the extraction of path probability, and thus improves the system performance. Finally, experimentally evaluated DeweyTP encoding scheme were experimentally evaluated in aspects of time and space efficiency.