1.
Zhang Z, Tan X. An Implicit Trust Region Approach to Behavior Regularized Offline Reinforcement Learning. AAAI [Internet]. 2024Mar.24 [cited 2024Jul.16];38(15):16944-52. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/29637