M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference

Authors

  • Chuxiong Sun National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences
  • Peng He Beijing University of Posts and Telecommunications
  • Qirui Ji National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Zehua Zang National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences University of Chinese Academy of Sciences
  • Jiangmeng Li National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences
  • Rui Wang National Key Laboratory of Space Integrated Information System, Institute of Software, Chinese Academy of Sciences National Key Laboratory of Complex System Modeling and Simulation Technology
  • Wei Wang Beijing University of Posts and Telecommunications

DOI:

https://doi.org/10.1609/aaai.v40i30.39764

Abstract

Communication is essential in coordinating the behaviors of multiple agents. However, existing methods primarily emphasize content, timing, and partners for information sharing, often neglecting the critical aspect of integrating shared information. This gap can significantly impact agents' ability to understand and respond to complex, uncertain interactions, thus affecting overall communication efficiency. To address this issue, we introduce M2I2, a novel framework designed to enhance the agents' capabilities to assimilate and utilize received information effectively. M2I2 equips agents with advanced capabilities for masked state modeling and joint-action prediction, enriching their perception of environmental uncertainties and facilitating the anticipation of teammates' intentions. This approach ensures that agents are furnished with both comprehensive and relevant information, bolstering more informed and synergistic behaviors. Moreover, we propose a Dimensional Rational Network, innovatively trained via a meta-learning paradigm, to identify the importance of dimensional pieces of information, evaluating their contributions to decision-making and auxiliary tasks. Then, we implement an importance-based heuristic for selective information masking and sharing. This strategy optimizes the efficiency of masked state modeling and the rationale behind information sharing. We evaluate M2I2 across diverse multi-agent tasks, the results demonstrate its superior performance, efficiency, and generalization capabilities, over existing state-of-the-art methods in various complex scenarios.

Published

2026-03-14

How to Cite

Sun, C., He, P., Ji, Q., Zang, Z., Li, J., Wang, R., & Wang, W. (2026). M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference. Proceedings of the AAAI Conference on Artificial Intelligence, 40(30), 25672–25681. https://doi.org/10.1609/aaai.v40i30.39764

Issue

Section

AAAI Technical Track on Machine Learning VII