Lancewicki, Tal, Aviv Rosenberg, and Yishay Mansour. “Learning Adversarial Markov Decision Processes With Delayed Feedback”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 7 (June 28, 2022): 7281-7289. Accessed July 20, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20690.