Gelada, Carles, and Marc G. Bellemare. 2019. “Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift”. Proceedings of the AAAI Conference on Artificial Intelligence 33 (01):3647-55. https://doi.org/10.1609/aaai.v33i01.33013647.