Weng, P. (2011). Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences. Proceedings of the International Conference on Automated Planning and Scheduling, 21(1), 282-289. https://doi.org/10.1609/icaps.v21i1.13448