Ghosh, D., Atia, G. K., & Wang, Y. (2026). ORVIT: Near-Optimal Online Distributionally Robust Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 21278–21286. https://doi.org/10.1609/aaai.v40i25.39273