Zhou, Qi, HouQiang Li, and Jie Wang. “Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 6941-6948. Accessed April 24, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/6177.