Weng, Paul. “Markov Decision Processes With Ordinal Rewards: Reference Point-Based Preferences”. Proceedings of the International Conference on Automated Planning and Scheduling 21, no. 1 (March 22, 2011): 282-289. Accessed April 28, 2024. https://ojs.aaai.org/index.php/ICAPS/article/view/13448.