[1]
Chen, Y. et al. 2023. Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence. 37, 1 (Jun. 2023), 396–404. DOI:https://doi.org/10.1609/aaai.v37i1.25113.