Booth, Serena, W. Bradley Knox, Julie Shah, Scott Niekum, Peter Stone, and Alessandro Allievi. “The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 5 (June 26, 2023): 5920-5929. Accessed July 21, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/25733.