Štrupl, M., Faccio, F., Ashley, D. R., Srivastava, R. K., & Schmidhuber, J. (2022). Reward-Weighted Regression Converges to a Global Optimum. Proceedings of the AAAI Conference on Artificial Intelligence, 36(8), 8361-8369. https://doi.org/10.1609/aaai.v36i8.20811