DELGADO, T.; SÁNCHEZ SORONDO, M.; BRABERMAN, V.; UCHITEL, S. Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach. Proceedings of the International Conference on Automated Planning and Scheduling, [S. l.], v. 33, n. 1, p. 569-577, 2023. DOI: 10.1609/icaps.v33i1.27238. Disponível em: https://ojs.aaai.org/index.php/ICAPS/article/view/27238. Acesso em: 29 jul. 2024.