CtoD-MAT: Bridging Centralized and Decentralized Execution in Multi-Agent Reinforcement Learning (Student Abstract)

Authors

  • Shota Takayama Graduate School of Engineering, Tokyo University of Agriculture and Technology
  • Katsuhide Fujita Institute of Global Innovation Research, Tokyo University of Agriculture and Technology

DOI:

https://doi.org/10.1609/aaai.v40i48.42287

Abstract

Although centralized training with centralized execution (CTCE) excels at multi-agent coordination, its reliance on global information limits its use in the real world. Conversely, the practical decentralized execution (CTDE) paradigm often struggles with complex coordination. This paper bridges this critical gap by introducing the Centralized-to-Decentralized (CtoD) learning concept: a novel framework for transferring the knowledge of a powerful centralized policy into a robust, practical decentralized policy. Our method, CtoD-MAT, realizes this transition through a curriculum that gradually shifts agents from centralized to decentralized control. A key innovation is our dynamic scheduling mechanism, featuring a mediator module, which ensures a robust and effective knowledge transfer. Using challenging SMAC benchmarks, we demonstrate that CtoD-MAT successfully produces competitive decentralized policies, notably solving complex coordination tasks that are difficult for standard CTDE methods.

Downloads

Published

2026-03-14

How to Cite

Takayama, S., & Fujita, K. (2026). CtoD-MAT: Bridging Centralized and Decentralized Execution in Multi-Agent Reinforcement Learning (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41406–41408. https://doi.org/10.1609/aaai.v40i48.42287