Neustroev, G., de Weerdt, M., & Verzijlbergh, R. (2021). Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards. Proceedings of the International Conference on Automated Planning and Scheduling, 29(1), 292-300. https://doi.org/10.1609/icaps.v29i1.3491