1.
Booth S, Knox WB, Shah J, Niekum S, Stone P, Allievi A. The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications. AAAI [Internet]. 2023Jun.26 [cited 2024Jul.21];37(5):5920-9. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/25733