Fan, Jiahao, and Chien-Ming Chen. “Efficient Multimodal Large Language Model via Dynamic KV Cache Quantization”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 25, Mar. 2026, pp. 20994-01, doi:10.1609/aaai.v40i25.39241.