[1]

Neustroev, G., de Weerdt, M. and Verzijlbergh, R. 2021. Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards. Proceedings of the International Conference on Automated Planning and Scheduling. 29, 1 (May 2021), 292-300. DOI:https://doi.org/10.1609/icaps.v29i1.3491.