由于通用搜索引擎的综合性,不具备面向专业的特点,所以在准确性和速度等方面存在不足。因此针对Blog这个全新领域。提出了一个面向Bloz的网络爬行器算法,为Bloz语料搜集以及相关Bloz研究提供了方便。
The general crawler provides a great many help to people for finding information in Web.However,it has some drawback in terms of precision and efficiency because of it's generality and no specialty.Blog,as an emerging phenomenon of the Internet,has been concerned by more and more people.The authors propose a new algorithm of Blog-oriented Web crawler through considering "Blog" as a special "subject".