LOFTIN, R.; MACGLASHAN, J.; PENG, B.; TAYLOR, M.; LITTMAN, M.; HUANG, J.; ROBERTS, D. A Strategy-Aware Technique for Learning Behaviors from Discrete Human Feedback. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 28, n. 1, 2014. DOI: 10.1609/aaai.v28i1.8839. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/8839. Acesso em: 24 may. 2024.