Xiong, H., Xu, T., Liang, Y., & Zhang, W. . (2021). Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling. Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 10460-10468. https://doi.org/10.1609/aaai.v35i12.17252