Delgado, T., Sánchez Sorondo, M., Braberman, V., & Uchitel, S. (2023). Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach. Proceedings of the International Conference on Automated Planning and Scheduling, 33(1), 569-577. https://doi.org/10.1609/icaps.v33i1.27238