[1]
S. Li, W. Xiao, H. Wu, X. Zhang, D. An, and S. Lü, “State Proficiency-Based Adaptive Fine-Tuning for Offline-to-Online Reinforcement Learning”, AAAI, vol. 40, no. 28, pp. 23169–23176, Mar. 2026.