东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于强化学习的互联电网CPS自校正控制

期刊名称：电力系统保护与控制, 37(10), pp 33-38, 2009/5/16.期刊文章
时间：0
分类：TM71[电气工程—电力系统及自动化] TP181[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
作者机构：[1]华南理工大学电力学院,广东广州510640
相关基金：国家自然科学基金项目（50807016）;广东省自然科学基金博士启动基金项目（06300091）
相关项目：CPS标准下AGC的最优松驰控制及其马尔可夫决策过程

关键词：强化学习, Q学习算法, 自动发电控制, CPS标准, 自校正控制, reinforcement learning, Q-learning algorithm, automatic generation control, CPS, self-tuning control

中文摘要：

AGC是一个动态多级决策问题一一马尔可夫决策过程（MDP），应用强化学习算法可有效地实现控制策略的在线学习和动态优化决策。引入Q学习算法作为强化学习核心算法，将CPS值看作包含AGC的电力系统“环境”所给的“奖励”，依靠奖励值Q函数与CPS控制动作形成的闭环控制结构实现在线学习。学习目标是使CPS控制动作从环境获得的长期积累奖励值最大，从而快速自动地在线优化CPS控制系统的输出。仿真研究显示，引入强化学习自校正控制后显著增强了整个AGc系统的鲁棒性和适应性，有效提高了CPS考核合格率。

英文摘要：

The automatic generation control （AGC） problem is a stochastic multistage decision problem, which can be modeled as a Markovian Decision Process （MDP）. The paper introduces the Q-learning method as the core algorithm of reinforcement learning （RL）, and regards the CPS values as the rewards from the interconnected power systems. By regulating a closed-loop CPS control rule to maximize the total reward in the procedure of on-line learning, the optimal CPS control strategy can be gradually obtained. The case study shows that after adding the RL control, the robustness and adaptability of AGC system is enhanced obviously and the CPS compliance is ensured. This work is supported by National Natural Science Foundation of China（No.50807016） and Natural Science Funds of Guangdong Province （No. 06300091）.

同期刊论文项目