Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Michele Joshua Maggini; Dhia Merzougui; Rabiraj Bandyopadhyay; Gaël Dias; Fabrice Maurel; Pablo Gamallo

doi:10.1609/icwsm.v20i1.42712

Authors

Michele Joshua Maggini CiTIUS, University of Santiago de Compostela,
Dhia Merzougui University of Caen Normandie
Rabiraj Bandyopadhyay GESIS
Gaël Dias ENSICAEN, CNRS, Normandie Univ, GREYC UMR6072
Fabrice Maurel ENSICAEN, CNRS, Normandie Univ, GREYC UMR6072
Pablo Gamallo CiTIUS, University of Santiago de Compostela,

DOI:

https://doi.org/10.1609/icwsm.v20i1.42712

Abstract

This study provides a comprehensive benchmark of Large Language Model (LLM) adaptation paradigms, specifically Fine-Tuning (FT) vs. In-Context Learning (ICL), for the detection of hyperpartisan news, fake news, political bias, and harmful content. We evaluate encoder-only and decoder-only architectures across 10 datasets in five languages: English, Spanish, Portuguese, Arabic, and Bulgarian. Our analysis covers a wide spectrum of ICL strategies, including zero-shot prompts, rule-based codebooks, Chain-of-Thought (CoT), and few-shot selection using both random and diversity-optimized (Determinantal Point Process) exemplars. Experimental results reveal that FT consistently outperforms ICL; notably, smaller fine-tuned models often surpass the performance of larger models (e.g., Llama-3.1-8B, Mistral-Nemo, and Qwen2.5-7B) used in ICL settings. We further find that model architecture suitability is task-dependent: fine-tuned decoders excel at political bias and fake news detection, while encoders remain superior for hyperpartisan and harmful tweet classification. Among ICL methods, the codebook approach generally yields the highest accuracy, frequently outperforming CoT. Our findings underscore that despite the versatility of LLMs, task-specific fine-tuning remains the most effective strategy for identifying nuanced problematic content online.

Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information