(1)
Bai, Q.; Bedi, A. S.; Agarwal, M.; Koppel, A.; Aggarwal, V. Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach. AAAI 2022, 36, 3682-3689.