1.
Chen Y, Wang J, Lin L, Qi Z, Ma J, Shan Y. Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval. AAAI [Internet]. 2023 Jun. 26 [cited 2026 May 25];37(1):396-404. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/25113