Li, J., Ren, T., Yan, D., Su, H. and Zhu, J. (2022) “Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model”, Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), pp. 7417-7425. doi: 10.1609/aaai.v36i7.20705.