大数据的迅猛发展导致数据中心的存储规模急剧扩张,由此引发的高能耗已经成为数据中心普遍面临的一个突出问题,磁盘类存储介质在数据中心耗能中所占的比例也在逐年增加,能耗建模在目前学者们的研究中越来越受到关注。精确的磁盘能耗模型不仅可以解决数据中心中的电力配套问题,而且为当前数据中心各种能耗管理技术体现更为精确的节能效果。提出了一种基于统计量的磁盘能耗预测模型,该模型弥补了传统细粒度模型产生的额外负载影响,同时获取了比传统粗粒度模型更佳的预测准确率。在实际应用中,该模型不需要分析记录复杂的磁盘内部活动细节,也不需要繁杂的参数采集,仅需要存储系统中宏观的统计量作为参数,且预测精度与细粒度模型近似。通过实验验证,该模型在能耗预测上的平均误差为3%,并且针对同步IO及异步IO都有较好的预测效果。此外,该模型还可以应用于各种在线系统的能耗预测。
Due to the rapid development of big data in the data center ,power consumption of storage system is a major issue in today's datacenters . How to reduce the power consumption of storage systems has become an urgent issue and a hot research topic in the field of computer science .As the hard disk drive is the primary storage medium in today's storage systems ,modeling hard‐disk power consumption is attracting more attention in the current state of research . The accurate power consumption model of disk can not only solve the problem of power matching in data center devices , but also estimate the accuracy of energy‐efficient solutions . We develop a statistic‐based hard‐disk power modeling method that estimates the power consumption of storage workloads . The model makes up the weakness of traditional fine‐grained model and it is more accurate than the coarse‐grained model .In practical applications ,it does not need to record the disk internal activities ,and does not need to trace complex parameter .Our power estimation results are highly accurate ,which means error of 3% and the model is applicable to the synchronous IO and asynchronous IO .Moreover ,our model can also be applied to various online storage systems and data center .