Global Policy Construction in Modular Reinforcement Learning
DOI:
https://doi.org/10.1609/aaai.v29i1.9736Keywords:
Modular reinforcemenet learningAbstract
We propose a modular reinforcement learning algorithm which decomposes a Markov decision process into independent modules. Each module is trained using Sarsa(lambda). We introduce three algorithms for forming global policy from modules policies, and demonstrate our results using a 2D grid world.
Downloads
Published
2015-03-04
How to Cite
Zhang, R., Song, Z., & Ballard, D. (2015). Global Policy Construction in Modular Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9736
Issue
Section
Student Abstract Track