Dai, J., Ji, J., Yang, L., Zheng, Q., & Pan, G. (2023). Augmented Proximal Policy Optimization for Safe Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 7288-7295. https://doi.org/10.1609/aaai.v37i6.25888