ŠTRUPL, M.; FACCIO, F.; ASHLEY, D. R.; SRIVASTAVA, R. K.; SCHMIDHUBER, J. Reward-Weighted Regression Converges to a Global Optimum. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 36, n. 8, p. 8361-8369, 2022. DOI: 10.1609/aaai.v36i8.20811. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/20811. Acesso em: 28 apr. 2026.