[1]

You, X. et al. 2026. Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 14 (Mar. 2026), 12108–12116. DOI:https://doi.org/10.1609/aaai.v40i14.38200.