[1]
Weng, P. 2011. Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences. Proceedings of the International Conference on Automated Planning and Scheduling. 21, 1 (Mar. 2011), 282-289. DOI:https://doi.org/10.1609/icaps.v21i1.13448.