Zhai, Y., Li, Y., Gao, Z., Gong, X., Xu, K., Feng, D., … Wang, H. (2024). Optimistic Model Rollouts for Pessimistic Offline Policy Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16678–16686. https://doi.org/10.1609/aaai.v38i15.29607