Web用户访问模式挖掘技术可以从服务器、浏览器端的日志记录中自动发现用户的访问偏好、兴趣和趋势等信息,目前已经成为Web挖掘领域的一个研究热点。文章首先给出Web访问模式挖掘系统的一般框架模型,然后介绍了框架模型中主要组成部分的工作原理,在此基础上,对Web访问模式挖掘系统中的一些关键技术的最新研究进展状况作了阐述和分析,其中包括数据采集、数据预处理、模式发现、用户可视化界面等,最后分析了未来该领域的研究重点作了展望。
Web user access patterns mining technology, currently a research hotspot in the field of web mining, is able to discover automatically such knowledge as the user access preference, interests and trends from the log records of the web server and browser sides. This paper starts with an introduction to the general architecture model of web user access patterns mining system, which is followed by a description on the working principle of the main components of this model. After that the paper elaborates and analyzes recent progress in some key technologies. Among them are data acquisition, data preprocessing, pattern discovery and visual user interface. The paper concludes with some prospects for the future research priorities in this field.