Zhou, Q., Li, H., & Wang, J. (2020). Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization. Proceedings of the AAAI Conference on Artificial Intelligence, 34(04), 6941-6948. https://doi.org/10.1609/aaai.v34i04.6177