1.
Zhang Y, Liu J, Li C, Niu Y, Yang Y, Liu Y, Ouyang W. A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning. AAAI [Internet]. 2024Mar.24 [cited 2024Jun.27];38(15):16908-16. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/29633