IsamasRed: A Public Dataset Tracking Reddit Discussions on Israel-Hamas Conflict

Authors

  • Kai Chen Information Sciences Institution, University of Southern California Department of Computer Science, University of Southern California
  • Zihao He Information Sciences Institution, University of Southern California Department of Computer Science, University of Southern California
  • Keith Burghardt Information Sciences Institution, University of Southern California
  • Jingxin Zhang Department of Computer Science, University of Southern California
  • Kristina Lerman Information Sciences Institution, University of Southern California Department of Computer Science, University of Southern California

DOI:

https://doi.org/10.1609/icwsm.v18i1.31434

Abstract

The conflict between Israel and Palestinians significantly escalated after the October 7, 2023 Hamas attack, capturing global attention. To understand the public discourse on this conflict, we present a meticulously compiled dataset-IsamasRed-comprising nearly 400,000 conversations and over 8 million comments from Reddit, spanning from August 2023 to November 2023. We introduce an innovative keyword extraction framework leveraging a large language model to effectively identify pertinent keywords, ensuring a comprehensive data collection. Our initial analysis on the dataset, examining topics, controversy, emotional and moral language trends over time, highlights the emotionally charged and complex nature of the discourse. This dataset aims to enrich the understanding of online discussions, shedding light on the complex interplay between ideology, sentiment, and community engagement in digital spaces.

Downloads

Published

2024-05-28

How to Cite

Chen, K., He, Z., Burghardt, K., Zhang, J., & Lerman, K. (2024). IsamasRed: A Public Dataset Tracking Reddit Discussions on Israel-Hamas Conflict. Proceedings of the International AAAI Conference on Web and Social Media, 18(1), 1900-1912. https://doi.org/10.1609/icwsm.v18i1.31434