[1]

Zhu, T., Qiu, Y., Zhou, H. and Li, J. 2024. Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence. 38, 15 (Mar. 2024), 17202-17210. DOI:https://doi.org/10.1609/aaai.v38i15.29666.