CHEN, Wentse; HUANG, Shiyu; CHIANG, Yuan; PEARCE, Tim; TU, Wei-Wei; CHEN, Ting; ZHU, Jun. DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 38, n. 10, p. 11390–11398, 2024. DOI: 10.1609/aaai.v38i10.29019. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/29019. Acesso em: 14 may. 2026.