1.
De Asis K, Chan A, Pitis S, Sutton R, Graves D. Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning. AAAI [Internet]. 2020Apr.3 [cited 2022Aug.7];34(04):3741-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/5784