Gelada C, Bellemare MG. Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift. AAAI [Internet]. 2019 Jul. 17 [cited 2026 May 19];33(01):3647-55. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/4246