[1]
X. Fan, Z. Sun, T. Ji, L. Shen, and T. Gui, “MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention Across Vision-Language Models”, AAAI, vol. 40, no. 36, pp. 30638-30646, Mar. 2026.