Li, Z., Z. Wei, Z. Fan, H. Shan, and X. Huang. “An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 15, May 2021, pp. 13324-32, doi:10.1609/aaai.v35i15.17573.