Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning

Authors

  • Nhat Nguyen Adelaide University Università di Foggia
  • Duong Nguyen Adelaide University
  • Gianluca Rizzo HES-SO Valais Università di Torino
  • Hung Nguyen Adelaide University

DOI:

https://doi.org/10.1609/icaps.v36i1.42854

Abstract

Decentralized Monte Carlo Tree Search (Dec-MCTS) is widely used for cooperative multi-agent planning but struggles in sparse or skewed reward environments. We introduce Coordinated Boltzmann MCTS (CB-MCTS), which replaces deterministic UCT with a stochastic Boltzmann policy and a decaying entropy bonus for sustained yet focused exploration. While Boltzmann exploration has been studied in single-agent MCTS, applying it in multi-agent systems poses unique challenges. CB-MCTS is the first to address this. We analyze CB‑MCTS in the simple-regret setting and show in simulations that it outperforms Dec-MCTS in deceptive scenarios and remains competitive on standard benchmarks, providing a robust solution for multi-agent planning.

Downloads

Published

2026-06-08

How to Cite

Nguyen, N., Nguyen, D., Rizzo, G., & Nguyen, H. (2026). Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning. Proceedings of the International Conference on Automated Planning and Scheduling, 36(1), 404–408. https://doi.org/10.1609/icaps.v36i1.42854