Gelada, Carles, and Marc G. Bellemare. “Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift”. Proceedings of the AAAI Conference on Artificial Intelligence 33, no. 01 (July 17, 2019): 3647-3655. Accessed April 23, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/4246.