[1]
A. Hüyük, W. R. Zame, and M. van der Schaar, “Inferring Lexicographically-Ordered Rewards from Preferences”, AAAI, vol. 36, no. 5, pp. 5737-5745, Jun. 2022.