[1]
W. Zhou and W. Li, “Programmatic Reward Design by Example”, AAAI, vol. 36, no. 8, pp. 9233–9241, Jun. 2022.