大数据时代,各种信息数据日益迅猛增长,Hadoop为海量数据的处理提供了良好的解决方案.针对云计算环境中的海量数据存储问题,介绍云存储技术的概念和体系结构,分析Hadoop两大核心机制HDFS和MapReduce,利用Hadoop成功搭建分布式计算平台,并将其应用到海量社交网络数据的存储.实践证明,系统运行良好,为未来进行社交网络大数据分析提供平台和数据保证.
All kinds of information and data increasingly have rapid growth in the age of big data,Hadoop provides a good solution for the treatment of massive datas. For the problem of massive data storage in the cloud computing environment,this paper introduces the concept and architecture of cloud storage technology, analyzes two coremechanisms of Hadoop which include HDFS and MapReduce,successfully builds a distributed computing platform with Hadoop,and applies to massive social network data storage. The practice proves that the system runs well,which provides platform and data guarantee to analyse social network big data in future.