Tang, Haoran, Meng Cao, Jinfa Huang, Ruyang Liu, Peng Jin, Ge Li, and Xiaodan Liang. “MUSE: Mamba Is Efficient Multi-Scale Learner for Text-Video Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 7 (April 11, 2025): 7238–7246. Accessed May 10, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32778.