随着大数据时代的到来,对网络信息的时效性进行评价已成为当今研究的热点。将以Web新闻作为研究对象,对大数据环境下的Web信息提取和中文分词处理等技术进行研究,并在此基础上,提出一种基于Web语义信息提取的网络信息时效性评价算法。实验结果将充分体现算法实现的有效性,既可引导网络用户关注更有价值的 Web信息,也可帮助网站管理者构建一个时效性更高的网站。
With the arrival of the big data era, the currency evaluation of network information has become a spot for today’s research. This paper will take Web news as the object of study and study the technology of Web information extraction and Chinese word segmentation in big data environment. On the basis of the above, this paper proposes an algorithm of network information currency evaluation based on Web semantic extraction method. The experimental results fully reflect the validity of the algorithm implementation. The study of technology plays a very important role in leading network users pay attention to more valuable Web information and helping Web site managers build a higher currency network.