近年来,XML流管理是数据流研究领域中的一个热点研究问题.不同于传统数据库管理系统,XML数据流管理系统需要在使用少量内存的情况下随时处理在线到达的数据流;由于XML流具有递归的、层次的结构,并且用户兴趣通常用XML查询语言来表示,因此,XML流管理技术与关系型数据流(基于元组的数据)管理技术有很大不同.综述XML流管理技术及各项研究成果;指出XML流管理系统的特点;比较了当前典型的XML数据流管理系统;系统地阐述和分析当前国内外已有的XML流数据处理方法(包括基于自动机的方法、基于索引的方法,基于序列的方法)和流处理的相关优化技术;指出尚需进一步研究的问题.
XML stream management system fits a large class of new applications such as publish/ subscribe system, network monitoring systems, and the extensible markup language has become the defacto standard for data representation and exchange of Web data. Therefore, there have been a hot spot in the area of data steam research recently. Different from traditional XML database management systems, an XML stream system aims to provide fast, on-the-fly matching of XML-eneoded data to user's query. It usually involves handling the XML stream coming online at any moment and any order, and requiring timely response without incurring more memory cost. Because that XML stream is nested and recursive and user's interests are represented by XML query languages such as XQuery or XPath, XML stream management system is very different from relational data stream (i. e. tuple based stream) system. In this paper, a comprehensive overview over researches relevant for XML stream management system is presented, the characteristics of XML stream management are pointed out; typical XML stream management systems are compared, existing approaches for processing XML stream are discussed and analyzed including the approaches based on automaton, index and sequence; optimizing techniques for processing XML streams are described. Also, the further researches are pointed out.