Zhang, Zhe, and Xiaoyang Tan. 2024. “An Implicit Trust Region Approach to Behavior Regularized Offline Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (15):16944-52. https://doi.org/10.1609/aaai.v38i15.29637.