Corazza, Jan, Ivan Gavran, and Daniel Neider. 2022. “Reinforcement Learning With Stochastic Reward Machines”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (6):6429-36. https://doi.org/10.1609/aaai.v36i6.20594.