Zhang, Z., & Tan, X. (2024). An Implicit Trust Region Approach to Behavior Regularized Offline Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16944-16952. https://doi.org/10.1609/aaai.v38i15.29637