Ma, Y. J., Shen, A., Bastani, O., & Dinesh, J. (2022). Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(5), 5404–5412. https://doi.org/10.1609/aaai.v36i5.20478