Li, Jialian, Tongzheng Ren, Dong Yan, Hang Su, and Jun Zhu. 2022. “Policy Learning for Robust Markov Decision Process With a Mismatched Generative Model”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (7):7417-25. https://doi.org/10.1609/aaai.v36i7.20705.