[1]
Dalal, G., Szorenyi, B. and Thoppe, G. 2020. A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound. Proceedings of the AAAI Conference on Artificial Intelligence. 34, 04 (Apr. 2020), 3701-3708. DOI:https://doi.org/10.1609/aaai.v34i04.5779.