[1]
Y. Zhang, “A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning”, AAAI, vol. 38, no. 15, pp. 16908-16916, Mar. 2024.