(1)
Chen, Y.; Wang, J.; Lin, L.; Qi, Z.; Ma, J.; Shan, Y. Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval. AAAI 2023, 37, 396-404.