(1)
Zhu, T.; Qiu, Y.; Zhou, H.; Li, J. Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning. AAAI 2024, 38, 17202-17210.