OX-MABSR: A Benchmark for Open-domain Explainable Multimodal Aspect-Based Sentiment Reasoning

Authors

  • Xinjing Liu Beijing University of Posts and Telecommunications
  • ZiXin Xue Beijing University of Posts and Telecommunications
  • Pengyue Lin Beijing University of Posts and Telecommunications
  • Xinyu Tu Beijing University of Posts and Telecommunications
  • Siwei Xu Beijing University of Posts and Telecommunications
  • Ruifan Li Beijing University of Posts and Telecommunications

DOI:

https://doi.org/10.1609/aaai.v40i38.40493

Abstract

Multimodal Aspect-Based Sentiment Analysis (MABSA) involves extracting aspect terms from text-image pairs and identifying their sentiments. Most existing tasks consider one fixed sentiment category with explicitly mentioned aspects. However, these tasks seldom consider expressive sentiment categories, implicit aspects, and explainability. To this end, we introduce a novel task of Open-domain Explainable Multimodal Aspect-Based Sentiment Reasoning (OX-MABSR). This task enables the prediction of open-vocabulary aspect-sentiment pairs, together with the generation of sentiment explanations and reasoning paths. To benchmark OX-MABSR task, we construct OX-MABSR-Bench, a dataset annotated with explicit and implicit aspects, expressive sentiment categories, as well as perceptual and cognitive two-level explanations. The explanations capture visual and textual cues, including aesthetics, facial expressions, scenes, and textual semantics, together with background and situational knowledge. In addition, we annotate the reasoning paths that trace how the sentiment evolves from surface cues to a deeper contextual understanding. To address OX-MABSR task, we propose MABSR-LLM. Extensive experimental results show our MABSR-LLM outperforms strong baselines. To the best of our knowledge, we are the first to provide a unified framework for open-domain and explainable MABSR.

Downloads

Published

2026-03-14

How to Cite

Liu, X., Xue, Z., Lin, P., Tu, X., Xu, S., & Li, R. (2026). OX-MABSR: A Benchmark for Open-domain Explainable Multimodal Aspect-Based Sentiment Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32204–32212. https://doi.org/10.1609/aaai.v40i38.40493

Issue

Section

AAAI Technical Track on Natural Language Processing III