Lancewicki, Tal, Aviv Rosenberg, and Yishay Mansour. 2022. “Learning Adversarial Markov Decision Processes With Delayed Feedback”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (7):7281-89. https://doi.org/10.1609/aaai.v36i7.20690.