(1)
Chen, W.; Li, W.; Liu, X.; Yang, S.; Gao, Y. Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient. AAAI 2023, 37, 11542-11550.