1.
Zhou Q, Li H, Wang J. Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization. AAAI [Internet]. 2020Apr.3 [cited 2024Mar.28];34(04):6941-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/6177