(1)
Chen, Y.; Zhang, X.; Xie, Q.; Zhu, X. Exact Policy Recovery in Offline RL With Both Heavy-Tailed Rewards and Data Corruption. AAAI 2024, 38, 11416-11424.