(1)
Liu, W.; Huo, L.; Jing, Y.; Zhang, X.; Xie, J. MRACL: Multi-Reward Space Guided Adaptive Curriculum Reinforcement Learning for LLMs. AAAI 2026, 40, 37663-37672.