提出了一种基于改进学习分类器的多机器人强化学习方法。增强学习使机器人能发现一组用于指导其强化学习行为的规则。遗传算法则在现有的规则中淘汰掉较差的,并利用较优的种群规则产生出新的学习规则。规则合并能提高多机器人的并行强化学习效率,使多个机器人自主地学习到相互协作的最优策略。算法的分析和仿真表明,将改进的学习分类器用于多机器人的强化学习是有效的。
This paper proposes a multi-robots reinforcement learning method based on improved learning classifier system.The enhanced learning enables robots to discover a group rules for guiding their reinforcement leaning behavior.Genetic algorithm could eliminate worse ones in the existing rules and produce new learning rules with the superior population rules.The merged rules can increase multi-robots' learning efficiency in parallel,thus the multi-robots could learn to collaborate with the best strategy.The algorithm analysis and the simulation indicate that the improved learning classifier system used in the multi-robot reinforcement learning is feasible and effective.