The Gray Area: Characterizing Moderator Disagreement on Reddit

Authors

  • Shayan Alipour Sapienza University of Rome Toronto Metropolitan University
  • Shruti Phadke Drexel University
  • Seyed Shahabeddin Mousavi Stanford University
  • Amirhossein Afsharrad Stanford University
  • Morteza Zihayat Toronto Metropolitan University
  • Mattia Samory Sapienza University of Rome

DOI:

https://doi.org/10.1609/icwsm.v20i1.42625

Abstract

Volunteer moderators play a crucial role in sustaining online dialogue, but they often disagree about what should or should not be allowed. In this paper, we study the complexity of content moderation with a focus on disagreements between moderators, which we term the “gray area” of moderation. Leveraging 5 years and 4.3 million moderation log entries from 24 subreddits of different topics and sizes, we characterize how gray area, or disputed cases, differ from undisputed cases. We show that one-in-seven moderation cases are disputed among moderators, often addressing transgressions where users' intent is not directly legible, such as in trolling and brigading, as well as tensions around community governance. This is concerning, as almost half of all gray area cases involved automated moderation decisions. Through extensive empirical analyses, we show that even state-of-the-art language models struggle to adjudicate gray area cases. Focusing on improving these models is unpromising. Through information-theoretic evaluations, we demonstrate that gray area cases are inherently harder to adjudicate than undisputed cases. We highlight the key role of expert human moderators in overseeing the moderation process and provide insights about the challenges of current moderation processes and tools.

Downloads

Published

2026-05-25

How to Cite

Alipour, S., Phadke, S., Mousavi, S. S., Afsharrad, A., Zihayat, M., & Samory, M. (2026). The Gray Area: Characterizing Moderator Disagreement on Reddit. Proceedings of the International AAAI Conference on Web and Social Media, 20(1), 58–75. https://doi.org/10.1609/icwsm.v20i1.42625