(1)

Nemani, H.; Garimella, K. Large-Scale Multimodal Content Analysis and Annotation With Vision-Language Models. ICWSM 2026, 20, 1676-1699.