针对某些恶意页面利用搜索引擎的局限性隐藏在搜索结果排名较靠前的位置这一问题,本文提出了基于Hits算法的Web安全改进模型.该模型在Hits算法的基础上,结合向量空间模型来评价网页的风险程度,通过对恶意页面的Authority值进行"惩罚"来降低恶意页面在搜索结果中的排序,从而减低恶意页面被访问到的概率.实验结果表明,恶意网页的Authority值明显降低,而非恶意网页的Authority值有所上升,这使得用户通过搜索引擎点击到恶意页面的概率大幅降低.
In search engines, some malicious pages are hidden in search results athigh rank position. In this paper, an improved Hits algorithm-based web security model is proposed. The vector space model is used to evaluate the risk of the web pages. The malicious pages are punished to reduce the rank position in the search results. The probability of the malicious pages to be accessed is reduced. The experimental results show that the Authority value of a malicious web page is reduced and Authority value of the non-malicious web is increased in this model. Therefore, the probabili- ty that the malicious web pages are clicked is reduced.