LIU, Jiajun; HE, Yao; KE, Wenjun; WANG, Peng; SHANG, Ziyu; LI, Guozheng; XU, Zijie. Balanced Knowledge Distillation for Large Language Models with Mix-of-Experts. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 28, p. 23694–23702, 2026. DOI: 10.1609/aaai.v40i28.39543. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/39543. Acesso em: 15 may. 2026.