1.
Xiong H, Xu T, Liang Y, Zhang W. Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling. AAAI [Internet]. 2021May18 [cited 2022Jan.18];35(12):10460-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/17252