PADiff: Predictive and Adaptive Diffusion Policies for Ad Hoc Teamwork

Hohei Chan; Xinzhi Zhang; Antao Xiang; Weinan Zhang; Mengchen Zhao

doi:10.1609/aaai.v40i24.39078

Authors

Hohei Chan South China University of Technology
Xinzhi Zhang South China University of Technology
Antao Xiang South China University of Technology
Weinan Zhang Shanghai Jiao Tong University
Mengchen Zhao South China University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i24.39078

Abstract

Ad hoc teamwork (AHT) requires agents to collaborate with previously unseen teammates, which is crucial for many real-world applications. The core challenge of AHT is to develop an ego agent that can predict and adapt to unknown teammates on the fly. Conventional RL-based approaches optimize a single expected return, which often causes policies to collapse into a single dominant behavior, thus failing to capture the multimodal cooperation patterns inherent in AHT. In this work, we introduce PADiff, a diffusion-based approach that captures agent's multimodal behaviors, unlocking its diverse cooperation modes with teammates. However, standard diffusion models lack the ability to predict and adapt in non-stationary AHT scenarios. To address this limitation, we propose a novel diffusion-based policy that integrates critical predictive information about teammates into the denoising process. Extensive experiments across three environments demonstrate that PADiff outperforms existing AHT methods significantly.

PADiff: Predictive and Adaptive Diffusion Policies for Ad Hoc Teamwork

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information