(1)
Bai, Q.; Singh Bedi, A.; Aggarwal, V. Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm. AAAI 2023, 37, 6737-6744.