Ma, Yecheng Jason, Andrew Shen, Osbert Bastani, and Jayaraman Dinesh. 2022. “Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (5):5404-12. https://doi.org/10.1609/aaai.v36i5.20478.