Corazza, J., Gavran, I., & Neider, D. (2022). Reinforcement Learning with Stochastic Reward Machines. Proceedings of the AAAI Conference on Artificial Intelligence, 36(6), 6429-6436. https://doi.org/10.1609/aaai.v36i6.20594