[1]

R. Li, H. Huang, F. Wei, F. Xiong, Y. Wang, and X. Chu, “AdaCuRL: Adaptive Curriculum Reinforcement Learning with Invalid Sample Mitigation and Historical Revisiting”, AAAI, vol. 40, no. 27, pp. 23123–23131, Mar. 2026.