A Multimodal Prompt-based Framework for Analyzing Code-Mixed and Low-Resource Memes

Authors

  • Surendrabikram Thapa Virginia Tech
  • Hariram Veeramani University of California Los Angeles
  • Liang Hu DeepBlue Academy of Sciences Tongji University
  • Qi Zhang DeepBlue Academy of Sciences Tongji University
  • Wei Wang Shenzhen MSU-BIT University
  • Usman Naseem Macquarie University

DOI:

https://doi.org/10.1609/icwsm.v19i1.35909

Abstract

The emergence of social media has led memes to become a powerful mode of communication, blending text, images, and emojis. However, this surge in meme usage has also seen a rise in offensive material. With manual content moderation proving impractical due to the sheer volume of data, there's a pressing need for automated methods to identify harmful memes. Yet, existing research predominantly targets high-resource languages such as English, neglecting low-resource ones like Nepali. To bridge this gap, we introduce the first Nepali meme dataset annotated for hate speech and sentiment. Our contributions are threefold: (1) We create and release NeMeme, a unique dataset featuring Nepali and code-mixed Nepali memes (combining Nepali and English). (2) We evaluate NeMeme using cutting-edge unimodal and multimodal models to establish initial performance benchmarks. (3) We introduce MemeNePAL, a novel multimodal framework employing prompt-assisted learning to effectively categorize Nepali memes. MemeNePAL overcomes the shortcomings of prior state-of-the-art (SOTA) techniques, which were designed for high-resource languages and struggle with Nepali's linguistic differences and cultural subtleties. This work not only promotes inclusivity in content moderation research but also aligns with UN Sustainable Development Goals such as promoting well-being, reducing inequalities, and fostering peace. We adhere to FAIR principles by making the dataset publicly available.

Downloads

Published

2025-06-07

How to Cite

Thapa, S., Veeramani, H., Hu, L., Zhang, Q., Wang, W., & Naseem, U. (2025). A Multimodal Prompt-based Framework for Analyzing Code-Mixed and Low-Resource Memes. Proceedings of the International AAAI Conference on Web and Social Media, 19(1), 1913-1923. https://doi.org/10.1609/icwsm.v19i1.35909