Li, J., Ren, T., Yan, D., Su, H., & Zhu, J. (2022). Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), 7417–7425. https://doi.org/10.1609/aaai.v36i7.20705