Chen J, Yang Z, Shi J, Wo T, Tang J. MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning. AAAI [Internet]. 2026Mar.14 [cited 2026May2];40(24):20136-44. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/39100