Zhu, W. (2026) “VTD-CLIP: Video-to-Text Discretization via Prompting CLIP”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), pp. 13979–13987. doi: 10.1609/aaai.v40i16.38408.