1.
Gelada C, Bellemare MG. Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift. AAAI [Internet]. 2019Jul.17 [cited 2024Mar.28];33(01):3647-55. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/4246