(1)
Ghosh, D.; Atia, G. K.; Wang, Y. ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning. AAAI 2026, 40, 21278-21286.