TIAN, K.; CHENG, Y.; LIU, Y.; HOU, X.; CHEN, Q.; LI, H. Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 38, n. 6, p. 5207-5214, 2024. DOI: 10.1609/aaai.v38i6.28327. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/28327. Acesso em: 14 oct. 2024.