Group-aware Multiscale Ensemble Learning for Test-Time Multimodal Sentiment Analysis

Authors

  • Kai Tang Zhejiang University
  • Yixuan Tang National University of Singapore
  • Tianyi Chen Zhejiang University
  • Haokai Xu Zhejiang University
  • Qiqi Luo Ant Group
  • Jin Guang Zheng Ant Group
  • Zhixin Zhang Ant Group
  • Gang Chen Zhejiang University
  • Haobo Wang Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v40i30.39782

Abstract

Multi-modal Sentiment Analysis (MSA) enables machines to perceive human sentiments by integrating multiple modalities such as text, video, and audio. Despite recent progress, most existing methods assume distribution consistency between training and test data—a condition rarely met in real-world scenarios. To address domain shifts without relying on source data or target labels, Test-Time Adaptation (TTA) has emerged as a promising paradigm. However, applying TTA methods to MSA faces two challenges: a representation bottleneck inherent to the regression formulation and the inconsistency in modality fusion caused by modality-specific data augmentation techniques. To overcome these issues, we propose Group-aware Multiscale Ensemble Learning (GMEL), which leverages a von Mises-Fisher (vMF) mixture distribution to model latent sentiment groups and integrates a multi-scale re-dropout strategy for modality-agnostic feature augmentation, preserving fusion consistency. Extensive experiments on three benchmark datasets using two backbone architectures show that GMEL significantly outperforms existing baselines, demonstrating strong robustness to test-time distribution shifts in multi-modal sentiment analysis.

Downloads

Published

2026-03-14

How to Cite

Tang, K., Tang, Y., Chen, T., Xu, H., Luo, Q., Zheng, J. G., … Wang, H. (2026). Group-aware Multiscale Ensemble Learning for Test-Time Multimodal Sentiment Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 40(30), 25831–25839. https://doi.org/10.1609/aaai.v40i30.39782

Issue

Section

AAAI Technical Track on Machine Learning VII