DOHMEN, T.; TOPPER, N.; ATIA, G.; BECKUS, A.; TRIVEDI, A.; VELASQUEZ, A. Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning. Proceedings of the International Conference on Automated Planning and Scheduling, [S. l.], v. 32, n. 1, p. 574-582, 2022. DOI: 10.1609/icaps.v32i1.19844. Disponível em: https://ojs.aaai.org/index.php/ICAPS/article/view/19844. Acesso em: 19 apr. 2024.