(1)
Zhu, W.; Wang, Y.; Li, H.; Zhu, P. VTD-CLIP: Video-to-Text Discretization via Prompting CLIP. AAAI 2026, 40, 13979-13987.