[1]
B. da Silva and A. Barto, “TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration”, AAAI, vol. 26, no. 1, pp. 886-892, Sep. 2021.