[1]
T. Dohmen, N. Topper, G. Atia, A. Beckus, A. Trivedi, and A. Velasquez, “Inferring Probabilistic Reward Machines from Non-Markovian Reward Signals for Reinforcement Learning”, ICAPS, vol. 32, no. 1, pp. 574-582, Jun. 2022.