Gadot, Uri, Esther Derman, Navdeep Kumar, Maxence Mohamed Elfatihi, Kfir Levy, and Shie Mannor. “Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 19 (March 24, 2024): 21090–21098. Accessed May 31, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/30101.