1.
Liu W, Huo L, Jing Y, Zhang X, Xie J. MRACL: Multi-Reward Space Guided Adaptive Curriculum Reinforcement Learning for LLMs. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 17];40(44):37663-72. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/41101