Sequential Order Adjustment of Action Decisions for Multi-Agent Transformer (Student Abstract)

Authors

  • Shota Takayama Graduate School of Engineering, Tokyo University of Agriculture and Technology
  • Katsuhide Fujita Institute of Global Innovation Research, Tokyo University of Agriculture and Technology

DOI:

https://doi.org/10.1609/aaai.v39i28.35306

Abstract

Multi-agent reinforcement learning (MARL) trains multiple agents in shared environments. Recently, MARL models have significantly improved performance by leveraging sequential decision-making processes. Although these models can enhance performance, they do not explicitly con-sider the importance of the order in which agents make decisions. We propose AOAD-MAT, a novel model incorporating action decision sequence into learning. AOAD-MAT uses a Transformer-based actor-critic architecture to dynamically adjust agent action order. It introduces a subtask predicting the next agent to act, integrated into a PPO-based loss function. Experiments on StarCraft Multi-Agent Challenge and Multi-Agent MuJoCo benchmarks show AOAD-MAT out-performs existing models, demonstrating the effectiveness of adjusting agent order in MARL.

Downloads

Published

2025-04-11

How to Cite

Takayama, S., & Fujita, K. (2025). Sequential Order Adjustment of Action Decisions for Multi-Agent Transformer (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 29509-29511. https://doi.org/10.1609/aaai.v39i28.35306