Yan, R. (2023) “Video-Text Pre-training with Learned Regions for Retrieval”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(3), pp. 3100–3108. doi: 10.1609/aaai.v37i3.25414.