Xiong, H., Xu, T., Liang, Y. and Zhang, W. . (2021) “Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling”, Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), pp. 10460-10468. Available at: https://ojs.aaai.org/index.php/AAAI/article/view/17252 (Accessed: 22January2022).