Many Minds, One Path: LLM-Augmented Consensus Decision for Distributed Control in Multi-Agent Collaborative Stable Scenarios

Zhuohao Yu; Zhe Liu; Tao Ren; Chenxue Wang; Junjie Wang; Qing Wang

doi:10.1609/aaai.v40i33.40026

Authors

Zhuohao Yu State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Zhe Liu State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Tao Ren State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Chenxue Wang State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Junjie Wang State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Qing Wang State Key Laboratory of Complex System Modeling and Simulation Technology, Beijing, China Institute of Software Chinese Academy of Sciences, Beijing, China University of Chinese Academy of Sciences, Beijing, China

DOI:

https://doi.org/10.1609/aaai.v40i33.40026

Abstract

Distributed multi-agent systems are increasingly deployed in dynamic and high-stakes environments such as power grids, intelligent traffic systems, and collaborative robotics. In these systems, long-term stability, the ability to maintain coherent and safe system behavior over time, is critical but underexplored in existing research. This paper presents LLMASC, a framework designed to enhance long-term stability in multi-agent collaboration by combining semantic reasoning with decentralized control. LLMASC comprises three key components: a Semantic Perception Encoder that transforms heterogeneous agent observations into structured natural language; an LLM-Guided Consensus Decision module that enables strategic alignment through proposal exchange and voting; and a Policy Execution Controller that maps high-level plans to executable actions via reinforcement learning. We evaluate LLMASC across three representative simulation domains (Multi-Walker, Simulation of Urban Mobility and Power Grid Stabilization), spanning both physical and cyber-physical systems. Experiments show that LLMASC consistently outperforms the best baselines, improving stability rates by up to 44% and long-term success by 31%. Further analysis confirms its decision-making efficiency and robustness under varying agent populations and model choices.

Many Minds, One Path: LLM-Augmented Consensus Decision for Distributed Control in Multi-Agent Collaborative Stable Scenarios

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information