由于使用方便等优点,数据库上的关键字检索技术使用户可以得到所需信息而不必书写复杂的SQL语句.但大部分现有的检索方法都关注通过连接操作得到包含所有关键字的元组连接树,忽略了对于检索结果的信息整合,这从某种程度上影响了用户对于检索结果的判断.文中提出并实现一种改进的关键字检索系统框架,在具有层次结构的属性指导下对得到的元组连接树结果做聚合操作,通过寻找最低层次最小覆盖聚合将关系更为紧密的元组作为更加相关的检索结果反馈给用户.文中还提出了基本的聚合算法并对其做改进从而减少了系统的响应时间.同时,为了改善用户体验,文中定义并给出了检索结果的摘要问题及其算法,使用户最大程度地了解检索结果.实验数据表明,文中的方法能够以较高的效率和较低的计算代价有效地完成检索结果的聚合和摘要.
Keyword search(KWS) has been well accepted as a proven,user-friendly way to re-trieve information,and recently applied successfully on relational databases.Today,this tech-nique allows users to find pieces of information without having to compose complicated SQL que-ries.However,almost all the existing approaches focus on finding joined tuples matching a set of key words and return the results as joining networks of tuples.In order to feed back the user more relative information,this paper formulates an expanding version of existing system to answer aggregate keyword queries over hierarchical relational databases in which the value of a specific attribute is organized in hierarchical structure.This version retrieves information in the form of MaxLMC(Max-Lowest hierarchy Minimum Coverage aggregate,which consists of tuples more similar and closer to each other) under the conduct of the above hierarchical structure.A Nave algorithm is proposed to obtain MaxLMC and its enhancement is designed to reduce the system's responding time.Meanwhile,recognized that the number of returned answer might be extremely large in practical,we defined and studied the problem of effective exploration of large sets of aggregating tuples: summarization,a technique which has been applied to help the user find diverse aggregating tuples,thus can be used to improve the user experience.An extensive empirical evaluation using both real data sets and synthetic data sets is reported to verify the effectiveness and the efficiency of our methods.