Nguyen, Khanh, Ali Furkan Biten, Andres Mafla, Lluis Gomez, and Dimosthenis Karatzas. “Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 2 (June 26, 2023): 1940-1948. Accessed July 12, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/25285.