Web已经成为人们获取信息的重要来源,但Web上的信息并不都是真实可信的.因此,如何帮助用户快速判断Web上大量信息的可信性成为一个亟待解决的问题.文中提出一种基于内容信任的方法用以验证Web信息的可信程度.采用条件随机场模型进行Web信息的主题提取,利用提取的主题在Web上搜集候选证据,并利用时效性、主题相关度等特征验证候选证据的可靠性,最后进行可信度计算.实验结果表明提出的方法对评价Web信息的内容可信度是有效可行的.
Web has become an important information source for most people.However,the information on the web is not always true and credible.How to help people quickly judge the credibility of information on the web was required.A new method based on content trust was proposed in this paper,which aim to evaluate the credibility of web information.Firstly,the method utilized the conditional random field model to get the theme of web information content,which can be used to a clue to collect the candidate evidence on the web.Secondly,the truthfulness of evidence can be evaluated by the characteristic of the concurrency and relevancy of the evidence and web information theme and other characteristic.Finally,the credibility value of web information was calculated.The experiment results show the method proposed in the credibility evaluation of web information content is reasonable and effective.