The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse

Authors

  • Valentin Hofmann University of Oxford LMU Munich
  • Hinrich Schütze LMU Munich
  • Janet B. Pierrehumbert University of Oxford

Keywords:

Qualitative and quantitative studies of social media, Social network analysis; communities identification; expertise and authority discovery, Subjectivity in textual data; sentiment analysis; polarity/opinion identification and extraction, linguistic analyses of social media behavior

Abstract

We introduce the Reddit Politosphere, a large-scale resource of online political discourse covering more than 600 political discussion groups over a period of 12 years. It is to the best of our knowledge the largest and ideologically most comprehensive dataset of its type now available. One key feature of the Reddit Politosphere is that it consists of both text and network data, allowing for methodologically-diverse analyses. We describe in detail how we create the Reddit Politosphere, present descriptive statistics, and sketch potential directions for future research based on the resource.

Downloads

Published

2022-05-31

How to Cite

Hofmann, V., Schütze, H., & Pierrehumbert, J. B. (2022). The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse. Proceedings of the International AAAI Conference on Web and Social Media, 16(1), 1259-1267. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/19377