(1)
Ben-Porat, O.; Mansour, Y.; Moshkovitz, M.; Taitler, B. Principal-Agent Reward Shaping in MDPs. AAAI 2024, 38, 9502-9510.