Chen, Yiding, Xuezhou Zhang, Qiaomin Xie, and Xiaojin Zhu. 2024. “Exact Policy Recovery in Offline RL With Both Heavy-Tailed Rewards and Data Corruption”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (10):11416-24. https://doi.org/10.1609/aaai.v38i10.29022.