[1]
Zhao, C. et al. 2025. Logarithmic Regret for Linear Markov Decision Processes with Adversarial Corruptions. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 21 (Apr. 2025), 22759–22767. DOI:https://doi.org/10.1609/aaai.v39i21.34436.