Sumers, T. R., M. K. Ho, R. D. Hawkins, K. Narasimhan, and T. L. Griffiths. “Learning Rewards From Linguistic Feedback”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 7, May 2021, pp. 6002-10, doi:10.1609/aaai.v35i7.16749.