(1)
HasanzadeZonuzy, A.; Bura, A.; Kalathil, D.; Shakkottai, S. Learning With Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs. AAAI 2021, 35, 7667-7674.