(1)

Zhu, W.; Wang, Y.; Li, H.; Zhu, P. VTD-CLIP: Video-to-Text Discretization via Prompting CLIP. AAAI 2026, 40, 13979-13987.