WENG, P. Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences. Proceedings of the International Conference on Automated Planning and Scheduling, [S. l.], v. 21, n. 1, p. 282-289, 2011. DOI: 10.1609/icaps.v21i1.13448. Disponível em: https://ojs.aaai.org/index.php/ICAPS/article/view/13448. Acesso em: 3 oct. 2024.