[1]

Y. Zhang, “A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning”, AAAI, vol. 38, no. 15, pp. 16908–16916, Mar. 2024.