Wei, Honghao, Xin Liu, and Lei Ying. “A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 4 (June 28, 2022): 3868-3876. Accessed June 21, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20302.