M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference

Chuxiong Sun; Peng He; Qirui Ji; Zehua Zang; Jiangmeng Li; Rui Wang; Wei Wang

doi:10.1609/aaai.v40i30.39764

Authors

Chuxiong Sun National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences
Peng He Beijing University of Posts and Telecommunications
Qirui Ji National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
Zehua Zang National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
Jiangmeng Li National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences
Rui Wang National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences National Key Laboratory of Complex System Modeling and Simulation Technology
Wei Wang Beijing University of Posts and Telecommunications

DOI:

https://doi.org/10.1609/aaai.v40i30.39764

Abstract

Communication is essential in coordinating the behaviors of multiple agents. However, existing methods primarily emphasize content, timing, and partners for information sharing, often neglecting the critical aspect of integrating shared information. This gap can significantly impact agents' ability to understand and respond to complex, uncertain interactions, thus affecting overall communication efficiency. To address this issue, we introduce M2I2, a novel framework designed to enhance the agents' capabilities to assimilate and utilize received information effectively. M2I2 equips agents with advanced capabilities for masked state modeling and joint-action prediction, enriching their perception of environmental uncertainties and facilitating the anticipation of teammates' intentions. This approach ensures that agents are furnished with both comprehensive and relevant information, bolstering more informed and synergistic behaviors. Moreover, we propose a Dimensional Rational Network, innovatively trained via a meta-learning paradigm, to identify the importance of dimensional pieces of information, evaluating their contributions to decision-making and auxiliary tasks. Then, we implement an importance-based heuristic for selective information masking and sharing. This strategy optimizes the efficiency of masked state modeling and the rationale behind information sharing. We evaluate M2I2 across diverse multi-agent tasks, the results demonstrate its superior performance, efficiency, and generalization capabilities, over existing state-of-the-art methods in various complex scenarios.

M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information