DELGADO, Tomás; SÁNCHEZ SORONDO, Marco; BRABERMAN, Víctor; UCHITEL, Sebastián. Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach. Proceedings of the International Conference on Automated Planning and Scheduling, [S. l.], v. 33, n. 1, p. 569–577, 2023. DOI: 10.1609/icaps.v33i1.27238. Disponível em: https://ojs.aaai.org/index.php/ICAPS/article/view/27238. Acesso em: 25 may. 2026.