Nemani, Harsha, and Kiran Garimella. 2026. “Large-Scale Multimodal Content Analysis and Annotation With Vision-Language Models”. Proceedings of the International AAAI Conference on Web and Social Media 20 (1):1676-99. https://doi.org/10.1609/icwsm.v20i1.42718.