Li, Chao, Yupeng Zhang, Jianqi Wang, Yujing Hu, Shaokang Dong, Wenbin Li, Tangjie Lv, Changjie Fan, and Yang Gao. 2024. “Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (16):17453-60. https://doi.org/10.1609/aaai.v38i16.29694.