用户兴趣模型是电子商务个性化推荐服务的基础,用户兴趣数据的获取则是构建用户兴趣模型的核心环节。传统的轮询采集方法存在数据源不够全面、在线应用可扩展性差的不足,会导致同一业务分析所需数据采集时间跨度大、先采集数据可能失效等情况,使得最终业务分析结果出现偏差。针对上述问题,对B2C电子商务用户兴趣数据进行了深入分析,提出了一种基于智能Agent的多源用户兴趣数据采集机制DFM-IA(Data Fetching Mechanism based on Intelligent Agent)。DFM-IA以用户Session为基本处理单元,设计了四种智能Agent(Fetching Agent、Watching Agent、Sort Agent、Logical Agent)和三条排序规则,对七类用户兴趣数据(浏览行为、关键词搜索、收藏行为、购物车行为、订单行为、支付行为、评价行为)进行排序与合并处理,从而在丰富数据采集源的同时大幅提高了在线数据采集效率,有助于解决推荐服务的数据稀疏性问题。仿真实验表明了该机制的高效性。
How to realize accurate, fast and personalized recommendation service is an important research topic of social computing. Recommender systems can help users improve the quality and efficiency of decision-making, and is considered to be the most effective tool to solve the current problem of information overload. The essence of recommender systems is a user-oriented decision support system, and can be used to help E-commerce enterprise managers run target selection. Thus, recommender systems have been regarded as important strategic considerations for enterprises. User interest model is the fundamental of E-commerce recommender systems. Data collection of user interest is the core work for building user interest model. The traditional polling method has some disadvantages, such as incomprehensive data sources and poor scalability for online applications. These disadvantages will lead to some bad situations. For instance, data collection for analysis of same business may span a big time interval, or prior data may be invalidated. All these bad situations could eventually result in skewed analysis. Hence, B2C websites need excellent data collection methods to provide strong support for personalized recommendation service, which is one type of big data analysis. In Section 1 we introduced and analyzed the related work. These interest data, which are created during the course of user-website interaction, can be divided into two types: explicit ratings and implicit ratings. We analyzed the advantages and disadvantages of the two types of data in detail. In Section 2 we deeply analyzed seven types of user interest data in B2C E-commerce. By the analysis of user-system interaction behaviors in some big B2C E-commerce websites at home and abroad, we drew a graph to describe the interest expression levels of user behaviors. After that, we summarized and analyzed the seven types of user interest data sources, including browsing behavior, keyword search, collect behavior, shopping cart behavior, order behavior, payment b