(1)
da Silva, B.; Barto, A. TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration. AAAI 2021, 26, 886-892.