Emergent Fast-Slow Dynamics in Multi-Agent Q-Learning for Networked Stochastic Games

Authors

  • Yuxin Geng School of Mathematical Sciences, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Wolfram Barfuss Transdisciplinary Research Area Sustainable Futures, University of Bonn, Germany Center for Development Research, University of Bonn, Germany Institute for Food and Resource Economics, University of Bonn, Germany
  • Xingru Chen School of Artificial Intelligence, Beihang University, Beijing 100191, China

DOI:

https://doi.org/10.1609/aaai.v40i35.40186

Abstract

Understanding the emergence of collective behaviors of multi-agent systems requires investigating the learning dynamics. However, the theoretical analysis of large-scale graph-structured multi-agent reinforcement learning (MARL) systems remains challenging due to agent heterogeneity and the intrinsic coupling between state transitions and individual Q-value updates. In this work, we develop a unified theoretical framework that captures the evolution of agent behaviors at both individual and population levels. By leveraging the pair approximation technique from statistical physics, we derive a closed set of evolution equations that accurately describe the temporal dynamics of the system. Our analysis also reveals a separation of time scales. For small learning rates, state transitions equilibrate rapidly, while Q-value updates evolve slowly with stationary state distributions. Through extensive agent-based simulations, we validate the robustness of our theoretical results and explain the mechanisms that lead to the emergence of cooperation in social dilemmas. Our framework offers new perspectives for bridging complex systems science and MARL, providing insights for the design of cooperative and resilient AI.

Downloads

Published

2026-03-14

How to Cite

Geng, Y., Barfuss, W., & Chen, X. (2026). Emergent Fast-Slow Dynamics in Multi-Agent Q-Learning for Networked Stochastic Games. Proceedings of the AAAI Conference on Artificial Intelligence, 40(35), 29450-29458. https://doi.org/10.1609/aaai.v40i35.40186

Issue

Section

AAAI Technical Track on Multiagent Systems