Chen, Wentse, Shiyu Huang, Yuan Chiang, Tim Pearce, Wei-Wei Tu, Ting Chen, and Jun Zhu. 2024. “DGPO: Discovering Multiple Strategies With Diversity-Guided Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (10):11390-98. https://doi.org/10.1609/aaai.v38i10.29019.