BAI, Qinbo; BEDI, Amrit Singh; AGARWAL, Mridul; KOPPEL, Alec; AGGARWAL, Vaneet. Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 36, n. 4, p. 3682–3689, 2022. DOI: 10.1609/aaai.v36i4.20281. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/20281. Acesso em: 25 may. 2026.