(1)

Dai, J.; Ji, J.; Yang, L.; Zheng, Q.; Pan, G. Augmented Proximal Policy Optimization for Safe Reinforcement Learning. AAAI 2023, 37, 7288-7295.