(1)
Zhao, C.; Zhang, X.; Wang, B.; Li, S. Logarithmic Regret for Linear Markov Decision Processes With Adversarial Corruptions. AAAI 2025, 39, 22759-22767.