1.
Ye Y, Zhou X, Chen Z, Li D, Gu H, Zhou JP, et al. K-12EduBench: A Benchmark for Evaluating Large Language Models’ Knowledge, Problem-Solving, and Educational Goal Cognition in K-12 Education. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 25];40(40):34459-66. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/40744