(1)
Dalal, G.; Szorenyi, B.; Thoppe, G. A Tale of Two-Timescale Reinforcement Learning With the Tightest Finite-Time Bound. AAAI 2020, 34, 3701-3708.