Labeled Datasets for Research on Information Operations
DOI:
https://doi.org/10.1609/icwsm.v19i1.35958Abstract
Social media platforms have become a hub for political activities and discussions, democratizing participation in these endeavors. However, they have also become an incubator for manipulation campaigns, like information operations (IOs). Some social media platforms have released datasets related to such IOs originating from different countries. However, we lack comprehensive control data that can enable the development of IO detection methods. To bridge this gap, we present new labeled datasets about 26 campaigns, which contain both IO posts verified by a social media platform and over 13M posts by 303k accounts that discussed similar topics in the same time frames (control data). The datasets will facilitate the study of narratives, network interactions, and engagement strategies employed by coordinated accounts across various campaigns and countries. By comparing these coordinated accounts against organic ones, researchers can develop and benchmark IO detection algorithms.Downloads
Published
2025-06-07
How to Cite
Seckin, O. C., Pote, M., Nwala, A. C., Yin, L., Luceri, L., Flammini, A., & Menczer, F. (2025). Labeled Datasets for Research on Information Operations. Proceedings of the International AAAI Conference on Web and Social Media, 19(1), 2567–2574. https://doi.org/10.1609/icwsm.v19i1.35958
Issue
Section
Dataset Papers