东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

多链MDP的鲁棒控制策略求解

ISSN号：1004-731X
期刊名称：《系统仿真学报》
时间：0
分类：TP202[自动化与计算机技术—控制科学与工程;自动化与计算机技术—检测技术与自动化装置]
作者机构：[1]合肥工业大学计算机与信息学院,合肥230009
相关基金：国家自然科学基金项目（60404009）：安徽省自然科学基金项目（070416242）;安徽高校省级自然科学研究重点项目（KJ2007A063）

关键词：多链MDP, 性能势, 鲁棒控制, 并行遗传算法, multi-chain MDP, performance potential, robust control, parallel genetic algorithm

中文摘要：

马尔可夫决策过程（MDP）的许多优化算法一般依赖系统的转移速率,而系统参数的不确定性使得MDP的转移速率往往很难精确得知。针对一类不确定多链MDP模型,基于性能势对参数不相关和相关两种情况下的鲁棒控制问题进行了探讨,并分别给出求解系统最优鲁棒性能的策略迭代和并行遗传算法。最后,通过一个数值例子分析相关算法的有效性。

英文摘要：

Optimization techniques for Markov decision Process （MDP） usually depend on the transition rates of the underlying stochastic processes, whose exact values are hard to get due to the possible uncertainty of system parameters. The robust control of a class of uncertain multi-chain MDP was discussed with either independent parameters or dependent parameters, by using performance potential. A policy iteration algorithm and parallel genetic algorithm was respectively provided to derive the system＇s robust optimal performance. Finally, a numerical example was used to illustrate the effectiveness of these algorithms.

同期刊论文项目