[1]
U. Gadot, E. Derman, N. Kumar, M. M. Elfatihi, K. Levy, and S. Mannor, “Solving Non-rectangular Reward-Robust MDPs via Frequency Regularization”, AAAI, vol. 38, no. 19, pp. 21090–21098, Mar. 2024.