XML作为网上数据表示和交换的标准具有日益广泛的应用。近年来,XML元素级检索得到越来越多信息检索领域研究者的关注。如何提高XML元素级检索效果已经成为一个重要的研究课题。在LEMUR系统里实现了一种针对XML元素级检索的新反馈算法,大幅度地提高了检索结果的精度。利用INEX提供的XML文档集、评测系统等进行了长期的实验。实验数据显示,该算法以内容作为反馈信息使系统的平均精度提高了15.70%,以内容和结构作为反馈信息使系统的平均精度提高了18.19%。
As the de-facto standard for data representation and exchange on the Web,XML is being widely used in many applications.Recent trends in IR research demonstrate the growing interest in XML retrieval on element level.Many open issues appear when considering the effectiveness of XML retrieval on element.A new feedback algorithm is implemented in LEMUR system to improve the effectiveness of XML retrieval on element.The performance of the new feedback algorithm is satisfactory.The data of the long-term experiment is provided by INEX.Experimental results demonstrate that the precision of retrieval results on element level is increased 15.70% when adding content information only and 18.19% when adding both content and structure information.