Delgado, T. (2023) “Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach”, Proceedings of the International Conference on Automated Planning and Scheduling, 33(1), pp. 569–577. doi: 10.1609/icaps.v33i1.27238.