Li, S., Xiao, W., Wu, H., Zhang, X., An, D., & Lü, S. (2026). State Proficiency-Based Adaptive Fine-Tuning for Offline-to-Online Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(28), 23169–23176. https://doi.org/10.1609/aaai.v40i28.39484