[1]
Cui, X., Zhu, M., Qin, Y., Xie, L., Zhou, W. and Li, H. 2025. Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 22 (Apr. 2025), 23724-23732. DOI:https://doi.org/10.1609/aaai.v39i22.34543.