Cao, S., Wang, B., Zhang, W., & Ma, L. (2022). Visual Consensus Modeling for Video-Text Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 167-175. https://doi.org/10.1609/aaai.v36i1.19891