Cao, S., B. Wang, W. Zhang, and L. Ma. “Visual Consensus Modeling for Video-Text Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 1, June 2022, pp. 167-75, doi:10.1609/aaai.v36i1.19891.