[1]
N. Yang, P. Wang, G. Liu, H. Zhang, P. Lyu, and J. Wang, “Proactive Constrained Policy Optimization with Preemptive Penalty”, AAAI, vol. 40, no. 32, pp. 27583–27591, Mar. 2026.