HasanzadeZonuzy, A., Bura, A., Kalathil, D., & Shakkottai, S. (2021). Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs. Proceedings of the AAAI Conference on Artificial Intelligence, 35(9), 7667-7674. https://doi.org/10.1609/aaai.v35i9.16937