Deb, Rohan, and Shalabh Bhatnagar. 2022. “Gradient Temporal Difference With Momentum: Stability and Convergence”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (6):6488-96. https://doi.org/10.1609/aaai.v36i6.20601.