Many Minds, One Path: LLM-Augmented Consensus Decision for Distributed Control in Multi-Agent Collaborative Stable Scenarios

Authors

  • Zhuohao Yu State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
  • Zhe Liu State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
  • Tao Ren State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
  • Chenxue Wang State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
  • Junjie Wang State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
  • Qing Wang State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v40i33.40026

Abstract

Distributed multi-agent systems are increasingly deployed in dynamic and high-stakes environments such as power grids, intelligent traffic systems, and collaborative robotics. In these systems, long-term stability, the ability to maintain coherent and safe system behavior over time, is critical but underexplored in existing research. This paper presents LLMASC, a framework designed to enhance long-term stability in multi-agent collaboration by combining semantic reasoning with decentralized control. LLMASC comprises three key components: a Semantic Perception Encoder that transforms heterogeneous agent observations into structured natural language; an LLM-Guided Consensus Decision module that enables strategic alignment through proposal exchange and voting; and a Policy Execution Controller that maps high-level plans to executable actions via reinforcement learning. We evaluate LLMASC across three representative simulation domains (Multi-Walker, Simulation of Urban Mobility and Power Grid Stabilization), spanning both physical and cyber-physical systems. Experiments show that LLMASC consistently outperforms the best baselines, improving stability rates by up to 44% and long-term success by 31%. Further analysis confirms its decision-making efficiency and robustness under varying agent populations and model choices.

Downloads

Published

2026-03-14

How to Cite

Yu, Z., Liu, Z., Ren, T., Wang, C., Wang, J., & Wang, Q. (2026). Many Minds, One Path: LLM-Augmented Consensus Decision for Distributed Control in Multi-Agent Collaborative Stable Scenarios. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 28014-28022. https://doi.org/10.1609/aaai.v40i33.40026

Issue

Section

AAAI Technical Track on Machine Learning X