Neustroev, Grigory, Mathijs de Weerdt, and Remco Verzijlbergh. “Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes With Unbounded Rewards”. Proceedings of the International Conference on Automated Planning and Scheduling 29, no. 1 (May 25, 2021): 292-300. Accessed April 23, 2024. https://ojs.aaai.org/index.php/ICAPS/article/view/3491.