(1)
Wei, H.; Liu, X.; Ying, L. A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes. AAAI 2022, 36, 3868-3876.