(1)
Li, S.; Xiao, W.; Wu, H.; Zhang, X.; An, D.; Lü, S. State Proficiency-Based Adaptive Fine-Tuning for Offline-to-Online Reinforcement Learning. AAAI 2026, 40, 23169-23176.