1.
Weng P. Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences. ICAPS [Internet]. 2011 Mar. 22 [cited 2026 May 26];21(1):282-9. Available from: https://ojs.aaai.org/index.php/ICAPS/article/view/13448