Delgrange, Florent, Ann Nowé, and Guillermo A. Pérez. “Distillation of RL Policies With Formal Guarantees via Variational Abstraction of Markov Decision Processes”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 6 (June 28, 2022): 6497-6505. Accessed April 25, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20602.