Cui, X., Zhu, M., Qin, Y., Xie, L., Zhou, W., & Li, H. (2025). Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), 23724-23732. https://doi.org/10.1609/aaai.v39i22.34543