[1]
Badger, K., Huang, J. and Petrik, M. 2026. Convergence of Fast Policy Iteration in Markov Games and Robust MDPs. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 24 (Mar. 2026), 19649-19656. DOI:https://doi.org/10.1609/aaai.v40i24.39045.