Weng, P. (2011). Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences. Proceedings of the International Conference on Automated Planning and Scheduling, 21(1), 282–289. https://doi.org/10.1609/icaps.v21i1.13448