Delgado, T., M. Sánchez Sorondo, V. Braberman, and S. Uchitel. “Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach”. Proceedings of the International Conference on Automated Planning and Scheduling, vol. 33, no. 1, July 2023, pp. 569-77, doi:10.1609/icaps.v33i1.27238.