SUMERS, Theodore R.; HO, Mark K.; HAWKINS, Robert D.; NARASIMHAN, Karthik; GRIFFITHS, Thomas L. Learning Rewards From Linguistic Feedback. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 7, p. 6002–6010, 2021. DOI: 10.1609/aaai.v35i7.16749. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/16749. Acesso em: 26 may. 2026.