Delgrange, F., Nowé, A., & Pérez, G. A. (2022). Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes. Proceedings of the AAAI Conference on Artificial Intelligence, 36(6), 6497-6505. https://doi.org/10.1609/aaai.v36i6.20602