SEVADE: Self-Evolving Multi-Agent Analysis with Decoupled Evaluation for Hallucination-Resistant Sarcasm Detection

Authors

  • Ziqi Liu Xi'an Jiaotong-Liverpool University
  • Ziyang Zhou Xi'an Jiaotong-Liverpool University
  • Yilin Li Xi'an Jiaotong-Liverpool University
  • Mingxuan Hu Xi'an Jiaotong-Liverpool University
  • Yushan Pan Xi'an Jiaotong-Liverpool University
  • Zhijie Xu Xi'an Jiaotong-Liverpool University
  • Yangbin Chen Xi'an Jiaotong-Liverpool University

DOI:

https://doi.org/10.1609/aaai.v40i35.40200

Abstract

Sarcasm detection is a crucial yet challenging Natural Language Processing task. Existing Large Language Model methods are often limited by single-perspective analysis, static reasoning pathways, and a susceptibility to hallucination when processing complex ironic rhetoric, which impacts their accuracy and reliability. To address these challenges, we propose SEVADE, a novel Self-Evolving multi-agent Analysis framework with Decoupled Evaluation for hallucination-resistant sarcasm detection. The core of our framework is a Dynamic Agentive Reasoning Engine (DARE), which utilizes a team of specialized agents grounded in linguistic theory to perform a multifaceted deconstruction of the text and generate a structured reasoning chain. Subsequently, a separate lightweight rationale adjudicator (RA) performs the final classification based solely on this reasoning chain. This decoupled architecture is designed to mitigate the risk of hallucination by separating complex reasoning from the final judgment. Extensive experiments on four benchmark datasets demonstrate that our framework achieves state-of-the-art performance, with average improvements of 7.01% in Accuracy and 6.55% in Macro-F1 score.

Downloads

Published

2026-03-14

How to Cite

Liu, Z., Zhou, Z., Li, Y., Hu, M., Pan, Y., Xu, Z., & Chen, Y. (2026). SEVADE: Self-Evolving Multi-Agent Analysis with Decoupled Evaluation for Hallucination-Resistant Sarcasm Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 40(35), 29573-29581. https://doi.org/10.1609/aaai.v40i35.40200

Issue

Section

AAAI Technical Track on Multiagent Systems