Štrupl, Miroslav, Francesco Faccio, Dylan R. Ashley, Rupesh Kumar Srivastava, and Jürgen Schmidhuber. 2022. “Reward-Weighted Regression Converges to a Global Optimum”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (8):8361-69. https://doi.org/10.1609/aaai.v36i8.20811.