(1)
Anagnostides, I.; Panageas, I.; Farina, G.; Sandholm, T. Optimistic Policy Gradient in Multi-Player Markov Games With a Single Controller: Convergence Beyond the Minty Property. AAAI 2024, 38, 9451-9459.