随着信息检索、数据挖掘、自然语言处理和机器学习等多领域的理论和技术的发展,搜索引擎技术得到了迅猛的发展和广泛的应用。本文旨在对搜索引擎的发展阶段进行分析,给出搜索引擎技术的发展概貌。基本思想是,一方面利用文本的有序聚类方法对搜索引擎的发展过程进行有序划分,并在此基础上对各个发展阶段的主要特点进行分析;另一方面通过基于词频的统计分析,深入挖掘搜索引擎技术和信息检索技术方面的研究热点,并对其进行分析和总结。
With the development of Information Retrieval, Data Mining, and Natural Language Processing, the technologies of Search Engine have been apphed and developed widely. In this study, text ordered cluster method is adopted to segment the whole development process of Search Engine, and the characteristics of the various stages are analyzed. Meanwhile, great efforts are made to discover the research hot spot. Based on the clustering results, the technology of search engines have experienced three stages, and different stage has different research hot spot. It is very useful to study and make use of various search engines.