Dabney, W. and Barto, A. (2021) “Adaptive Step-Size for Online Temporal Difference Learning”, Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), pp. 872-878. doi: 10.1609/aaai.v26i1.8313.