Zhu, Zhihong, Fan Zhang, Yunyan Zhang, Jinghan Sun, Guimin Hu, Hao Wu, Yuyan Chen, Bowen Xing, and Xian Wu. “S³-MSD: Large Vision-Language Model for Explainable and Generalizable Multi-Modal Sarcasm Detection”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 41 (March 14, 2026): 35266–35274. Accessed May 15, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40834.