[1]
R. Jiang, S. Zhang, V. Chelu, A. White, and H. . . van Hasselt, “Learning Expected Emphatic Traces for Deep RL”, AAAI, vol. 36, no. 6, pp. 7015-7023, Jun. 2022.