1.
Li S, Xiao W, Wu H, Zhang X, An D, Lü S. State Proficiency-Based Adaptive Fine-Tuning for Offline-to-Online Reinforcement Learning. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 14];40(28):23169-76. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/39484