[1]
F. Ma, Y. Zhou, F. Rao, Y. Zhang, and X. Sun, “Image Captioning with Multi-Context Synthetic Data”, AAAI, vol. 38, no. 5, pp. 4089–4097, Mar. 2024.