1.
You X, Huang Q, Li L, Zhang C, Liu X, Zhang M, et al. Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 27];40(14):12108-16. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/38200