NEUSTROEV, G.; DE WEERDT, M.; VERZIJLBERGH, R. Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards. Proceedings of the International Conference on Automated Planning and Scheduling, [S. l.], v. 29, n. 1, p. 292-300, 2021. DOI: 10.1609/icaps.v29i1.3491. Disponível em: https://ojs.aaai.org/index.php/ICAPS/article/view/3491. Acesso em: 23 apr. 2024.