[1]
S. Xia, X. Li, Y. Liu, T. Wu, and P. Liu, “Evaluating Mathematical Reasoning Beyond Accuracy”, AAAI, vol. 39, no. 26, pp. 27723–27730, Apr. 2025.