[1]
O. Ben-Porat, Y. Mansour, M. Moshkovitz, and B. Taitler, “Principal-Agent Reward Shaping in MDPs”, AAAI, vol. 38, no. 9, pp. 9502-9510, Mar. 2024.