[1]
F. Delgrange, A. Nowé, and G. A. Pérez, “Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes”, AAAI, vol. 36, no. 6, pp. 6497-6505, Jun. 2022.