HasanzadeZonuzy, A., A. Bura, D. Kalathil, and S. Shakkottai. “Learning With Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 9, May 2021, pp. 7667-74, doi:10.1609/aaai.v35i9.16937.