Ying, D., M. A. Guo, Y. Ding, J. Lavaei, and Z.-J. Shen. “Policy-Based Primal-Dual Methods for Convex Constrained Markov Decision Processes”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 9, June 2023, pp. 10963-71, doi:10.1609/aaai.v37i9.26299.