[1]
A. HasanzadeZonuzy, A. Bura, D. Kalathil, and S. Shakkottai, “Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs”, AAAI, vol. 35, no. 9, pp. 7667-7674, May 2021.