(1)

Fan, J.; Chen, C.-M. Efficient Multimodal Large Language Model via Dynamic KV Cache Quantization. AAAI 2026, 40, 20994-21001.