LI, Chao; ZHANG, Yupeng; WANG, Jianqi; HU, Yujing; DONG, Shaokang; LI, Wenbin; LV, Tangjie; FAN, Changjie; GAO, Yang. Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 38, n. 16, p. 17453–17460, 2024. DOI: 10.1609/aaai.v38i16.29694. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/29694. Acesso em: 15 may. 2026.