现有的查询结果多样化研究很难准确得到用户多样性需求并提供与用户查询各个方面需求相关的文档。针对这个问题,本文基于HITS算法的网页间链接分析特性,根据网页链接图直接计算查询结果列表中的文档可能满足用户多样性需求的程度,并将其应用到结果列表的重排序中以实现搜索结果多样性。在TREC大规模数据集合上的实验结果表明了该方法的有效性。
To avoid the problem that users′ diversity needs cannot be precisely obtained or documents provided cannot concern all aspects of the needs in a specific query,a new method was proposed based on the link-parsing feature of the HITS algorithm,in where the possibility was directly calculated according to the diversity of documents in the search result list for a query,and then the result list was re-ranked based on this value.Experimental results on the TREC′s large-scale data collections verified that this method was effective.