[1]
J. Li, T. Ren, D. Yan, H. Su, and J. Zhu, “Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model”, AAAI, vol. 36, no. 7, pp. 7417-7425, Jun. 2022.