Yan, Rui, et al. “Video-Text Pre-Training With Learned Regions for Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 3, June 2023, pp. 3100-8, doi:10.1609/aaai.v37i3.25414.