Sun, Yiliu, Zicheng Zhao, Yang Wei, Yanfang Zhang, and Chen Gong. “Well Begun, Half Done: Reinforcement Learning With Prefix Optimization for LLM Reasoning”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 39 (March 14, 2026): 33144–33152. Accessed May 16, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40598.