Agent Learning Using Action-Dependent Learning Rates in Computer Role-Playing Games

Authors

  • Maria Cutumisu University of Alberta
  • Duane Szafron University of Alberta
  • Michael Bowling University of Alberta
  • Richard S. Sutton University of Alberta

DOI:

https://doi.org/10.1609/aiide.v4i1.18667

Abstract

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditional reinforcement learning techniques. Our learning rates are action-dependent and increase or decrease based on trends in reward sequences. Our exploration rate decreases when the agent is learning successfully and increases otherwise. These improvements result in faster learning. We implemented this algorithm in NWScript, a scripting language used by BioWare Corp.’s Neverwinter Nights game, with the goal of improving the behaviours of game agents so that they react more intelligently to game events. Our goal is to provide an agent with the ability to (1) discover favourable policies in a multiagent computer role-playing game situation and (2) adapt to sudden changes in the environment.

Downloads

Published

2021-09-27

How to Cite

Cutumisu, M., Szafron, D., Bowling, M., & Sutton, R. (2021). Agent Learning Using Action-Dependent Learning Rates in Computer Role-Playing Games. Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 4(1), 22-29. https://doi.org/10.1609/aiide.v4i1.18667