Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract)

Feng Xu; Shengyi Jiang; Hao Yin; Zongzhang Zhang; Yang Yu; Ming Li; Dong Li; Wulong Liu

doi:10.1609/aaai.v35i18.17965

Authors

Feng Xu National Key Laboratory for Novel Software Technology, Nanjing University
Shengyi Jiang National Key Laboratory for Novel Software Technology, Nanjing University
Hao Yin National Key Laboratory for Novel Software Technology, Nanjing University
Zongzhang Zhang National Key Laboratory for Novel Software Technology, Nanjing University
Yang Yu National Key Laboratory for Novel Software Technology, Nanjing University
Ming Li National Key Laboratory for Novel Software Technology, Nanjing University
Dong Li Noah’s Ark Lab, Huawei Company
Wulong Liu Noah’s Ark Lab, Huawei Company

DOI:

https://doi.org/10.1609/aaai.v35i18.17965

Keywords:

Meta Learning, Reinforcement Learning, Representation Learning

Abstract

Meta-Reinforcement Learning (meta-RL) algorithms enable agents to adapt to new tasks from small amounts of exploration, based on the experience of similar tasks. Recent studies have pointed out that a good representation of a task is key to the success of off-policy context-based meta-RL. Inspired by contrastive methods in unsupervised representation learning, we propose a new method to learn the task representation based on the mutual information between transition tuples in a trajectory and the task embedding. We also propose a new estimation for task similarity based on Q-function, which can be used to form a constraint on the distribution of the encoded task variables, making the task encoder encode the task variables more effective on new tasks. Experiments on meta-RL tasks show that the newly proposed method outperforms existing meta-RL algorithms.

Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract)

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription