(1)
Liu, J.; He, Y.; Ke, W.; Wang, P.; Shang, Z.; Li, G.; Xu, Z. Balanced Knowledge Distillation for Large Language Models With Mix-of-Experts. AAAI 2026, 40, 23694-23702.