Attentive Experience Replay

Peiquan Sun; Wengang Zhou; Houqiang Li

doi:10.1609/aaai.v34i04.6049

Authors

Peiquan Sun University of Science and Technology of China
Wengang Zhou University of Science and Technology of China
Houqiang Li University of Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v34i04.6049

Abstract

Experience replay (ER) has become an important component of deep reinforcement learning (RL) algorithms. ER enables RL algorithms to reuse past experiences for the update of current policy. By reusing a previous state for training, the RL agent would learn more accurate value estimation and better decision on that state. However, as the policy is continually updated, some states in past experiences become rarely visited, and optimization over these states might not improve the overall performance of current policy. To tackle this issue, we propose a new replay strategy to prioritize the transitions that contain states frequently visited by current policy. We introduce Attentive Experience Replay (AER), a novel experience replay algorithm that samples transitions according to the similarities between their states and the agent's state. We couple AER with different off-policy algorithms and demonstrate that AER makes consistent improvements on the suite of OpenAI gym tasks.

Attentive Experience Replay

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription