Cao, Shuqiang, Bairui Wang, Wei Zhang, and Lin Ma. “Visual Consensus Modeling for Video-Text Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 1 (June 28, 2022): 167–175. Accessed May 31, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/19891.