Detection and Discovery of Misinformation Sources Using Attributed Webgraphs

Authors

  • Peter Carragher Carnegie Mellon University
  • Evan M. Williams Carnegie Mellon University
  • Kathleen M. Carley Carnegie Mellon University

DOI:

https://doi.org/10.1609/icwsm.v18i1.31309

Abstract

Website reliability labels underpin almost all research in misinformation detection. However, misinformation sources often exhibit transient behavior, which makes many such labeled lists obsolete over time. We demonstrate that Search Engine Optimization (SEO) attributes provide strong signals for predicting news site reliability. We introduce a novel attributed webgraph dataset with labeled news domains and their connections to outlinking and backlinking domains. We demonstrate the success of graph neural networks in detecting news site reliability using these attributed webgraphs, and show that our baseline news site reliability classifier outperforms current SoTA methods on the PoliticalNews dataset, achieving an F1 score of 0.96. Finally, we introduce and evaluate a novel graph-based algorithm for discovering previously unknown misinformation news sources.

Downloads

Published

2024-05-28

How to Cite

Carragher, P., Williams, E. M., & Carley, K. M. (2024). Detection and Discovery of Misinformation Sources Using Attributed Webgraphs. Proceedings of the International AAAI Conference on Web and Social Media, 18(1), 214-226. https://doi.org/10.1609/icwsm.v18i1.31309