[1]
H. Zhang, Z. Mao, K. Zhang, and Y. Zhang, “Show Your Faith: Cross-Modal Confidence-Aware Network for Image-Text Matching”, AAAI, vol. 36, no. 3, pp. 3262-3270, Jun. 2022.