Cao, Jiajun, Qinggang Zhang, Yunbo Tang, Zhishang Xiang, Chang Yang, and Jinsong Su. “Augmenting Intra-Modal Understanding in MLLMs for Robust Multimodal Keyphrase Generation”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 17 (March 14, 2026): 14511–14519. Accessed May 18, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/38468.