Zhang, H., Y. Lin, S. Shen, S. Han, and K. Lv. “Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 19, Mar. 2024, pp. 21770-8, doi:10.1609/aaai.v38i19.30177.