Yang, N. (2026) “Proactive Constrained Policy Optimization with Preemptive Penalty”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), pp. 27583–27591. doi: 10.1609/aaai.v40i32.39978.