Bridging the Gap for Test-Time Multimodal Sentiment Analysis

Authors

  • Zirun Guo Zhejiang University
  • Tao Jin Zhejiang University
  • Wenlong Xu Zhejiang University
  • Wang Lin Zhejiang University
  • Yangyang Wu Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v39i16.33867

Abstract

Multimodal sentiment analysis (MSA) is an emerging research topic that aims to understand and recognize human sentiment or emotions through multiple modalities. However, in real-world dynamic scenarios, the distribution of target data is always changing and different from the source data used to train the model, which leads to performance degradation. Common adaptation methods usually need source data, which could pose privacy issues or storage overheads. Therefore, test-time adaptation (TTA) methods are introduced to improve the performance of the model at inference time. Existing TTA methods are always based on probabilistic models and unimodal learning, and thus can not be applied to MSA which is often considered as a multimodal regression task. In this paper, we propose two strategies: Contrastive Adaptation and Stable Pseudo-label generation (CASP) for test-time adaptation for multimodal sentiment analysis. The two strategies deal with the distribution shifts for MSA by enforcing consistency and minimizing empirical risk, respectively. Extensive experiments show that CASP brings significant and consistent improvements to the performance of the model across various distribution shift settings and with different backbones, demonstrating its effectiveness and versatility.

Downloads

Published

2025-04-11

How to Cite

Guo, Z., Jin, T., Xu, W., Lin, W., & Wu, Y. (2025). Bridging the Gap for Test-Time Multimodal Sentiment Analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 39(16), 16987–16995. https://doi.org/10.1609/aaai.v39i16.33867

Issue

Section

AAAI Technical Track on Machine Learning II