[1]
D. Zeng, Y. Shen, M. Lin, Z. Yi, and J. Ouyang, “Zero-Shot Image Captioning with Multi-type Entity Representations”, AAAI, vol. 39, no. 21, pp. 22308–22316, Apr. 2025.