Zhu, Z., Zhang, F., Zhang, Y., Sun, J., Hu, G., Wu, H., … Wu, X. (2026). S³-MSD: Large Vision-Language Model for Explainable and Generalizable Multi-modal Sarcasm Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 40(41), 35266–35274. https://doi.org/10.1609/aaai.v40i41.40834