Fan, J. and Chen, C.-M. (2026) “Efficient Multimodal Large Language Model via Dynamic KV Cache Quantization”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), pp. 20994–21001. doi: 10.1609/aaai.v40i25.39241.