随着Web2.0技术的快速发展,社交网络、物联网、移动互联网等新兴服务行业日益涌现,Web数据呈爆炸式增长,成为炙手可热的“大数据”。Web大数据巨大的价值使得越来越多的人开始关注,如何获取Web数据并进行挖掘利用。在大数据的环境下,Web数据呈现出规模大、种类多、数据流高速性等特点,使得Web数据抽取与集成,数据分析,数据解释等方面的研究更加深入,与此同时,Web大数据的集成与挖掘仍存在着数据规模、数据多样性、数据时效性、隐私保护等方面的挑战。
With the rapid development of technologies about Web 2.0, new service such as social network, internet of things, mobile networks increasingly come to the fore. Web data explosively growth and become the hot big data. Because of the tremendous value of big data, more and more people begin to pay attention to obtain and mine it. Discusses the concept of big data, takes this as a springboard, analyzes the extraction and integration of Web data, data analysis, data interpretation. And summarizes some new challenges in the future.