Dai, Falcon Z., and Matthew R. Walter. 2021. “Loop Estimator for Discounted Values in Markov Reward Processes”. Proceedings of the AAAI Conference on Artificial Intelligence 35 (8):7169-75. https://doi.org/10.1609/aaai.v35i8.16881.