Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning

Authors

  • Yangkun Chen Shenzhen International Graduate School, Tsinghua University
  • Kai Yang Shenzhen International Graduate School, Tsinghua University
  • Jian Tao Shenzhen International Graduate School, Tsinghua University
  • Jiafei Lyu Shenzhen International Graduate School, Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v39i15.33749

Abstract

Recently, deep Multi-Agent Reinforcement Learning (MARL) has demonstrated its potential to tackle complex cooperative tasks, pushing the boundaries of AI in collaborative environments. However, the efficiency of these systems is often compromised by inadequate sample utilization and a lack of diversity in learning strategies. To enhance MARL performance, we introduce a novel sample reuse approach that dynamically adjusts policy updates based on observation novelty. Specifically, we employ a Random Network Distillation (RND) network to gauge the novelty of each agent's current state, assigning additional sample update opportunities based on the uniqueness of the data. We name our method Multi-Agent Novelty-GuidEd sample Reuse (MANGER). This method increases sample efficiency while promoting exploration and diverse agent behaviors. Our evaluations confirm substantial improvements in MARL effectiveness in complex cooperative scenarios such as Google Research Football and super-hard StarCraft II micromanagement tasks.

Downloads

Published

2025-04-11

How to Cite

Chen, Y., Yang, K., Tao, J., & Lyu, J. (2025). Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 39(15), 15930–15938. https://doi.org/10.1609/aaai.v39i15.33749

Issue

Section

AAAI Technical Track on Machine Learning I