(1)

Neustroev, G.; de Weerdt, M.; Verzijlbergh, R. Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes With Unbounded Rewards. ICAPS 2021, 29, 292-300.