(1)
Delgrange, F.; Nowé, A.; Pérez, G. A. Distillation of RL Policies With Formal Guarantees via Variational Abstraction of Markov Decision Processes. AAAI 2022, 36, 6497-6505.