Fang, X. (2024) “Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language”, Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), pp. 1735–1743. doi: 10.1609/aaai.v38i2.27941.