Blog信息源和信息量迅速增长,并已通过频繁的链接和信息交互在互联网上构建了一个动态且紧密的社会网络,成为现实世界一个重要的信息来源。目前,Blog领域的研究主要集中在Blog的定义与识别、内容挖掘、社区发现、重要性分析、Blog搜索和作弊Blog识别等几个方面。大部分研究采用或借鉴了链接分析、自然语言处理等方面的技术和方法,也提出了一些针对Blog领域的特定方法.分析和比较了Blog领域的相关研究,并且讨论了研究中存在的问题展望了未来的研究方向.
Popularity of bloggers and the amount of information in the blogosphere increase fast. Blogs have constituted a dynamic and tightly social network by using frequent links and information interaction, and become an important source of information for the real world. Most researches on blog mainly concentrate on blog definition and identification, content mining, community discovery, importance analysis, blog search and spare blog identification. Methods and technologies of link analysis and natural language processing are used in most works, and some blog-specific methods are proposed. This paper analyzes and compares these researches on blogosphere. Problems of current topics are discussed, and finally future directions are proposed in this paper.