Corazza, J., Gavran, I. and Neider, D. (2022) “Reinforcement Learning with Stochastic Reward Machines”, Proceedings of the AAAI Conference on Artificial Intelligence, 36(6), pp. 6429-6436. doi: 10.1609/aaai.v36i6.20594.