Cui, Xiao, Mo Zhu, Yulei Qin, Liang Xie, Wengang Zhou, and Houqiang Li. “Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 22 (April 11, 2025): 23724-23732. Accessed April 29, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/34543.