[1]

Nemani, H. and Garimella, K. 2026. Large-Scale Multimodal Content Analysis and Annotation with Vision-Language Models. Proceedings of the International AAAI Conference on Web and Social Media. 20, 1 (May 2026), 1676–1699. DOI:https://doi.org/10.1609/icwsm.v20i1.42718.