随着电费数据量的快速增长,某特大型集团公司财务管理信息系统传统的电费数据处理模式已经成为系统的性能瓶颈.Hadoop是一个可实现大规模分布式计算的开源框架,具有高效、可靠、可伸缩的优点,被广泛应用于海量数据处理领域.本文在对电费业务和Hadoop进行分析和研究的基础上,提出了电费数据新的处理模型,建立了基于Hadoop和Hive的电费明细数据处理平台.实验证明该模型可以有效解决目前海量电费数据处理面临的性能瓶颈,提高电费数据处理的速度和效率,并且可以提供高性能的明细数据查询功能.
The traditional electricity data processing methods of a corporation's financial management m~ormauon system have difficulty as the amount of electricity data is growing rapidly. Hadoop is a large-scale distributed computing framework that has the advantages of high efficient, reliable and scalable. Hadoop is widely used in the massive data processing field. Based on the analysis and research of electricity process and Hadoop, this paper proposed a novel electricity process model which includes a distributed computing platform based on Hadoop and Hive. The experimental results show the platform can effectively solve the performance bottleneck that the electricity processing service is facing and improve the speed and efficiency of electricity process. In addition, the new model can provide high-performance electricity detailed query functions.