[1]
C. Zhao, X. Zhang, B. Wang, and S. Li, “Logarithmic Regret for Linear Markov Decision Processes with Adversarial Corruptions”, AAAI, vol. 39, no. 21, pp. 22759–22767, Apr. 2025.