YOU, Xiaoxing; HUANG, Qiang; LI, Lingyu; ZHANG, Chi; LIU, Xiaopeng; ZHANG, Min; YU, Jun. Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 14, p. 12108–12116, 2026. DOI: 10.1609/aaai.v40i14.38200. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/38200. Acesso em: 27 may. 2026.