Tang, H., Cao, M., Huang, J., Liu, R., Jin, P., Li, G., & Liang, X. (2025). MUSE: Mamba Is Efficient Multi-scale Learner for Text-video Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 7238–7246. https://doi.org/10.1609/aaai.v39i7.32778