(1)
Li, C.; Zhang, Y.; Wang, J.; Hu, Y.; Dong, S.; Li, W.; Lv, T.; Fan, C.; Gao, Y. Optimistic Value Instructors for Cooperative Multi-Agent Reinforcement Learning. AAAI 2024, 38, 17453-17460.