Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces

Xiaotian Hao; Jianye Hao; Chenjun Xiao; Kai Li; Dong Li; Yan Zheng

doi:10.1609/aaai.v38i11.29121

Authors

Xiaotian Hao College of Intelligence and Computing, Tianjin University
Jianye Hao College of Intelligence and Computing, Tianjin University Noah’s Ark Lab, Huawei
Chenjun Xiao Noah’s Ark Lab, Huawei
Kai Li Noah’s Ark Lab, Huawei
Dong Li Noah’s Ark Lab, Huawei
Yan Zheng College of Intelligence and Computing, Tianjin University

DOI:

https://doi.org/10.1609/aaai.v38i11.29121

Keywords:

ML: Reinforcement Learning, MAS: Coordination and Collaboration, SO: Sampling/Simulation-based Search

Abstract

AlphaZero and MuZero have achieved state-of-the-art (SOTA) performance in a wide range of domains, including board games and robotics, with discrete and continuous action spaces. However, to obtain an improved policy, they often require an excessively large number of simulations, especially for domains with large action spaces. As the simulation budget decreases, their performance drops significantly. In addition, many important real-world applications have combinatorial (or exponential) action spaces, making it infeasible to search directly over all possible actions. In this paper, we extend AlphaZero and MuZero to learn and plan in more complex multiagent (MA) Markov decision processes, where the action spaces increase exponentially with the number of agents. Our new algorithms, MA Gumbel AlphaZero and MA Gumbel MuZero, respectively without and with model learning, achieve superior performance on cooperative multiagent control problems, while reducing the number of environmental interactions by up to an order of magnitude compared to model-free approaches. In particular, we significantly improve prior performance when planning with much fewer simulation budgets. The code and appendix are available at https://github.com/tjuHaoXiaotian/MA-MuZero.

Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription