Cao, Jiajun, et al. “Augmenting Intra-Modal Understanding in MLLMs for Robust Multimodal Keyphrase Generation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 17, Mar. 2026, pp. 14511-9, doi:10.1609/aaai.v40i17.38468.