Dalal G, Szorenyi B, Thoppe G. A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound. AAAI [Internet]. 2020Apr.3 [cited 2024Apr.20];34(04):3701-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/5779