Cao, J., Zhang, Q., Tang, Y., Xiang, Z., Yang, C., & Su, J. (2026). Augmenting Intra-Modal Understanding in MLLMs for Robust Multimodal Keyphrase Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(17), 14511–14519. https://doi.org/10.1609/aaai.v40i17.38468