(1)
Gu, S.; Sel, B.; Ding, Y.; Wang, L.; Lin, Q.; Jin, M.; Knoll, A. Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation. AAAI 2024, 38, 21099-21106.