[1]
F. Z. Dai and M. R. Walter, “Loop Estimator for Discounted Values in Markov Reward Processes”, AAAI, vol. 35, no. 8, pp. 7169-7175, May 2021.