[1]
C. Gelada and M. G. Bellemare, “Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift”, AAAI, vol. 33, no. 01, pp. 3647-3655, Jul. 2019.