Wang, Siqi, Zhengyu Chen, Teng Xiao, Zheqi Lv, Jinluan Yang, Xunliang Cai, Jingang Wang, and Xiaomeng Li. “Scaling and Transferability of Annealing Strategies in Large Language Model Training”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 40 (March 14, 2026): 33639–33647. Accessed May 14, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40653.