Delgado, T., Sánchez Sorondo, M., Braberman, V. and Uchitel, S. (2023) “Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach”, Proceedings of the International Conference on Automated Planning and Scheduling, 33(1), pp. 569-577. doi: 10.1609/icaps.v33i1.27238.