Zhou, Weichao, and Wenchao Li. “Programmatic Reward Design by Example”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 8 (June 28, 2022): 9233-9241. Accessed April 23, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20910.