CHEN, W.; LI, W.; LIU, X.; YANG, S.; GAO, Y. Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 37, n. 10, p. 11542-11550, 2023. DOI: 10.1609/aaai.v37i10.26364. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/26364. Acesso em: 29 apr. 2026.