[1]
H. Wei, X. Liu, and L. Ying, “A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes”, AAAI, vol. 36, no. 4, pp. 3868-3876, Jun. 2022.