Xiong, H., T. Xu, Y. Liang, and W. . Zhang. “Non-Asymptotic Convergence of Adam-Type Reinforcement Learning Algorithms under Markovian Sampling”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 12, May 2021, pp. 10460-8, doi:10.1609/aaai.v35i12.17252.