随着网络搜索用户的大规模增加,网络用户行为分析已成为网络信息检索系统进行架构分析、性能优化和系统维护的重要基石,是网络信息检索和知识挖掘的重要研究领域之一。为更好理解网络用户的搜索行为,该文基于7.56亿条真实网络用户行为日志,对用户行为进行分析和研究。我们主要考察了用户搜索行为中的查询长度、查询修改率、相关搜索点击率、首次/最后一次点击位置分布以及查询内点击数分布等信息。该文还基于不同类型的查询集合,考察用户在不同查询需求下的行为差异性。相关分析结果对搜索引擎算法优化和系统改进等都具有一定的参考意义。
With the growth in amount of search users, the behavior analysis has become one of the most important research issues for search engines in terms of architecture analysis, performance optimization and system maintenance. It is also a major area in both information retrieval and knowledge management. In order to better understand search behavior of web users, we analyzed web user behaviors based on 756 million entries of click-through logs. Several important aspects of user behaviors are studied, such as query length, ratio of query refining, query recommendation access, first/last click distribution, click number in query, et al. We also analyzed the differences in user behavior for different information needs based on separate query sets. These analyses may help improve both effectiveness and efficiency of search engines.