Xia, Shijie, Xuefeng Li, Yixin Liu, Tongshuang Wu, and Pengfei Liu. 2025. “Evaluating Mathematical Reasoning Beyond Accuracy”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (26):27723-30. https://doi.org/10.1609/aaai.v39i26.34987.