da Silva, Bruno, and Andrew Barto. 2021. “TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration”. Proceedings of the AAAI Conference on Artificial Intelligence 26 (1):886-92. https://doi.org/10.1609/aaai.v26i1.8286.