Yan, Rui, Mike Zheng Shou, Yixiao Ge, Jinpeng Wang, Xudong Lin, Guanyu Cai, and Jinhui Tang. 2023. “Video-Text Pre-Training With Learned Regions for Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence 37 (3):3100-3108. https://doi.org/10.1609/aaai.v37i3.25414.