Dai, F. Z. and Walter, M. R. (2021) “Loop Estimator for Discounted Values in Markov Reward Processes”, Proceedings of the AAAI Conference on Artificial Intelligence, 35(8), pp. 7169-7175. doi: 10.1609/aaai.v35i8.16881.