Neustroev, G., de Weerdt, M. and Verzijlbergh, R. (2021) “Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards”, Proceedings of the International Conference on Automated Planning and Scheduling, 29(1), pp. 292-300. doi: 10.1609/icaps.v29i1.3491.