Liu, J., He, Y., Ke, W., Wang, P., Shang, Z., Li, G., & Xu, Z. (2026). Balanced Knowledge Distillation for Large Language Models with Mix-of-Experts. Proceedings of the AAAI Conference on Artificial Intelligence, 40(28), 23694–23702. https://doi.org/10.1609/aaai.v40i28.39543