Liu, Jiajun, Yao He, Wenjun Ke, Peng Wang, Ziyu Shang, Guozheng Li, and Zijie Xu. “Balanced Knowledge Distillation for Large Language Models With Mix-of-Experts”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 28 (March 14, 2026): 23694–23702. Accessed May 15, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/39543.