[1]
W. Liu, L. Huo, Y. Jing, X. Zhang, and J. Xie, “MRACL: Multi-Reward Space Guided Adaptive Curriculum Reinforcement Learning for LLMs”, AAAI, vol. 40, no. 44, pp. 37663–37672, Mar. 2026.