Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables

Authors

  • Yang Chen NAOInstitute, University of Auckland, New Zealand School of Computer Science, University of Auckland, New Zealand
  • Xiao Lin School of Computer Science, Beijing Institute of Technology, Beijing, China
  • Bo Yan School of Computer Science, Beijing Institute of Technology, Beijing, China
  • Libo Zhang School of Computer Science, University of Auckland, New Zealand
  • Jiamou Liu School of Computer Science, University of Auckland, New Zealand
  • Neset Özkan Tan NAOInstitute, University of Auckland, New Zealand School of Computer Science, University of Auckland, New Zealand
  • Michael Witbrock NAOInstitute, University of Auckland, New Zealand School of Computer Science, University of Auckland, New Zealand

DOI:

https://doi.org/10.1609/aaai.v38i10.29021

Keywords:

ML: Imitation Learning & Inverse Reinforcement Learning, ML: Reinforcement Learning, MAS: Multiagent Learning

Abstract

Designing suitable reward functions for numerous interacting intelligent agents is challenging in real-world applications. Inverse reinforcement learning (IRL) in mean field games (MFGs) offers a practical framework to infer reward functions from expert demonstrations. While promising, the assumption of agent homogeneity limits the capability of existing methods to handle demonstrations with heterogeneous and unknown objectives, which are common in practice. To this end, we propose a deep latent variable MFG model and an associated IRL method. Critically, our method can infer rewards from different yet structurally similar tasks without prior knowledge about underlying contexts or modifying the MFG model itself. Our experiments, conducted on simulated scenarios and a real-world spatial taxi-ride pricing problem, demonstrate the superiority of our approach over state-of-the-art IRL methods in MFGs.

Published

2024-03-24

How to Cite

Chen, Y., Lin, X., Yan, B., Zhang, L., Liu, J., Özkan Tan, N., & Witbrock, M. (2024). Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables. Proceedings of the AAAI Conference on Artificial Intelligence, 38(10), 11407-11415. https://doi.org/10.1609/aaai.v38i10.29021

Issue

Section

AAAI Technical Track on Machine Learning I