1.
Dohmen T, Topper N, Atia G, Beckus A, Trivedi A, Velasquez A. Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning. ICAPS [Internet]. 2022Jun.13 [cited 2024Apr.19];32(1):574-82. Available from: https://ojs.aaai.org/index.php/ICAPS/article/view/19844