Internet上的不良信息日益增多成为危害严重的社会问题,对Internet进行监控成为一项迫切任务.而网络爬虫在信息搜索中起着明显的作用.为此,对链接价值的内容评价机制进行了研究,分析了影响链接价值的具体因素,并据此进行链接价值的计算以指导爬虫的搜索.实验结果表明,该方法有助于优先发现目标页面.
Increase of bad information in Internet is a serious social problem, and it is an emergent task to monitor the Interact. The web crawler is important in information search. Therefore, the value estimate based on content was studied, then the factors which affect the value of link was discussed. Calculating of the value depends on the factors. The values of links redound to conducting the crawler's search. Experimental results show that this approach can find the target pages betimes.