Zhu, T., Qiu, Y., Zhou, H., & Li, J. (2024). Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 17202-17210. https://doi.org/10.1609/aaai.v38i15.29666