[1]

M. Li, X. Shi, H. Leng, W. Zhou, H.-T. Zheng, and K. Zhang, “Learning Semantic Alignment with Global Modality Reconstruction for Video-Language Pre-training towards Retrieval”, AAAI, vol. 37, no. 1, pp. 1377-1385, Jun. 2023.