[1]
J. Feng, M. Feng, H. Song, W. Zhou, and H. Li, “SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning”, AAAI, vol. 38, no. 11, pp. 11961–11969, Mar. 2024.