Neustroev, Grigory, Mathijs de Weerdt, and Remco Verzijlbergh. 2021. “Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes With Unbounded Rewards”. Proceedings of the International Conference on Automated Planning and Scheduling 29 (1):292-300. https://doi.org/10.1609/icaps.v29i1.3491.