Feng, Jiaheng, Mingxiao Feng, Haolin Song, Wengang Zhou, and Houqiang Li. 2024. “SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (11):11961-69. https://doi.org/10.1609/aaai.v38i11.29083.