Delgado, Tomás, Marco Sánchez Sorondo, Víctor Braberman, and Sebastián Uchitel. “Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach”. Proceedings of the International Conference on Automated Planning and Scheduling 33, no. 1 (July 1, 2023): 569-577. Accessed September 3, 2024. https://ojs.aaai.org/index.php/ICAPS/article/view/27238.