Xiong, H., Xu, T., Liang, Y. and Zhang, W. . (2021) “Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling”, Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), pp. 10460-10468. doi: 10.1609/aaai.v35i12.17252.