Gelada, C., & Bellemare, M. G. (2019). Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 3647-3655. https://doi.org/10.1609/aaai.v33i01.33013647