(1)
Tang, H.; Cao, M.; Huang, J.; Liu, R.; Jin, P.; Li, G.; Liang, X. MUSE: Mamba Is Efficient Multi-Scale Learner for Text-Video Retrieval. AAAI 2025, 39, 7238-7246.