(1)
Ye, Y.; Zhou, X.; Chen, Z.; Li, D.; Gu, H.; Zhou, J. P.; Zhou, D. K-12EduBench: A Benchmark for Evaluating Large Language Models’ Knowledge, Problem-Solving, and Educational Goal Cognition in K-12 Education. AAAI 2026, 40, 34459-34466.