提出一种基于连续时间Markov决策过程的动态电源管理策略优化方法.通过建立动态电源管理系统的随机切换模型,将动态电源管理问题转化为带约束的策略优化问题,并给出一种基于矢量合成的策略梯度优化算法.随机切换模型对动态电源管理系统的描述精确,策略优化算法简便有效,既能离线计算,也适用于在线优化.仿真实验验证了该方法的有效性.
Based on continuous-time Markov decision processes, a power management policy optimization approach is proposed. First an event-driven stochastic switching model is introduced for power-managed systems. Under this model, the problem of dynamic power management is formulated as a constrained policy optimization problem. Then an efficient synthetic gradient-based policy optimization algorithm is presented. This algorithm can be employed to on-line optimization as well as off-line numerical computation.Finally, a simulation example is given to illustrate the effectiveness of the proposed approach.