(1)

You, X.; Huang, Q.; Li, L.; Zhang, C.; Liu, X.; Zhang, M.; Yu, J. Knowledge Completes the Vision: A Multimodal Entity-Aware Retrieval-Augmented Generation Framework for News Image Captioning. AAAI 2026, 40, 12108-12116.