(1)
Dai, F. Z.; Walter, M. R. Loop Estimator for Discounted Values in Markov Reward Processes. AAAI 2021, 35, 7169-7175.