[1]
K. Nguyen, A. F. Biten, A. Mafla, L. Gomez, and D. Karatzas, “Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia”, AAAI, vol. 37, no. 2, pp. 1940-1948, Jun. 2023.