[1]
Y. Zhai, “Optimistic Model Rollouts for Pessimistic Offline Policy Optimization”, AAAI, vol. 38, no. 15, pp. 16678–16686, Mar. 2024.