da Silva, B., and A. Barto. “TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 26, no. 1, Sept. 2021, pp. 886-92, doi:10.1609/aaai.v26i1.8286.