[1]
S. Booth, W. B. Knox, J. Shah, S. Niekum, P. Stone, and A. Allievi, “The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications”, AAAI, vol. 37, no. 5, pp. 5920-5929, Jun. 2023.