Yao, L., Wang, W., & Jin, Q. (2022). Image Difference Captioning with Pre-training and Contrastive Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3108-3116. https://doi.org/10.1609/aaai.v36i3.20218