Liu, Z., Liu, J. and Ma, F. (2024) “Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), pp. 3864–3872. doi: 10.1609/aaai.v38i4.28178.