1.
Nemani H, Garimella K. Large-Scale Multimodal Content Analysis and Annotation with Vision-Language Models. ICWSM [Internet]. 2026 May 25 [cited 2026 May 27];20(1):1676-99. Available from: https://ojs.aaai.org/index.php/ICWSM/article/view/42718