da Silva, B., & Barto, A. (2021). TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 886-892. https://doi.org/10.1609/aaai.v26i1.8286