[1]

Dohmen, T., Topper, N., Atia, G., Beckus, A., Trivedi, A. and Velasquez, A. 2022. Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning. Proceedings of the International Conference on Automated Planning and Scheduling. 32, 1 (Jun. 2022), 574-582. DOI:https://doi.org/10.1609/icaps.v32i1.19844.