1.
Neustroev G, de Weerdt M, Verzijlbergh R. Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards. ICAPS [Internet]. 2021May25 [cited 2024Jul.18];29(1):292-300. Available from: https://ojs.aaai.org/index.php/ICAPS/article/view/3491