(1)
Dabney, W.; Barto, A. Adaptive Step-Size for Online Temporal Difference Learning. AAAI 2021, 26, 872-878.