当前不确定数据广泛存在于诸如传感器网络、RFID网络、基于位置服务、移动对象管理网上购物和市场监控等各种实际应用中.不确定Skyline查询作为不确定数据管理的一个重要方面,由于其在决策制定、市场分析、环境监控和数据挖掘等方面的重要作用,近年来在数据库和网络计算领域受到广泛关注.首先,概述了各种不确定数据类型上的Skyline查询定义,包括离散、连续概率分布模型以及不完全数据上的Skyline查询定义;其次,分析了不确定Skyline查询的特点,并在此基础上综述了现有的各种不确定数据集上的集中式和分布式Skyline查询方法,重点分析了各种算法的原理和优缺点;再次,介绍了不确定数据流上的Skyline查询定义并综述了各种不确定数据流上的Skyline查询方法;最后,基于最新研究动态指出了未来不确定Skyline查询研究的趋势.
Uncertain data has already widely existed in many practical applications recently, such as sensor networks, RFID networks, location-based services, mobile object management, online shopping, and market surveillance. Uncertain skyline query, as an important aspect of uncertain data management, has received considerable attention in database and network computing fields recently, due to its importance in many practical applications, such as decision making, market analysis, and environment surveillance. Firstly, many existing definitions of skyline queries on various uncertain data, including the definitions on the discrete and continuous probability distribution model and incomplete data, have been extensively presented. Secondly, based on the analysis of the characteristics of skyline queries, various existing centralized and distributed skyline queries approaches are addressed, especially for the principles, as well as the advantages and disadvantages of the approaches. Thirdly, the skyline query definition over uncertain data streams is introduced and the existing studies on skyline queries over uncertain data streams are surveyed. Finally, based on the latest research work, some directions of the future research of uncertain skyline queries are outlined.