[1]
H. Tang, “MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval”, AAAI, vol. 39, no. 7, pp. 7238–7246, Apr. 2025.