Zhang, H., Lin, Y., Shen, S., Han, S., & Lv, K. (2024). Enhancing Off-Policy Constrained Reinforcement Learning through Adaptive Ensemble C Estimation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(19), 21770-21778. https://doi.org/10.1609/aaai.v38i19.30177