TMDC: A Two-Stage Modality Denoising and Complementation Framework for Multimodal Sentiment Analysis with Missing and Noisy Modalities

Authors

  • Yan Zhuang University of Electronic Science and Technology of China
  • Minhao Liu University of Electronic Science and Technology of China Shenzhen Institute for Advanced Study, UESTC
  • Yanru Zhang University of Electronic Science and Technology of China Shenzhen Institute for Advanced Study, UESTC
  • Jiawen Deng University of Electronic Science and Technology of China
  • Fuji Ren University of Electronic Science and Technology of China Shenzhen Institute for Advanced Study, UESTC

DOI:

https://doi.org/10.1609/aaai.v40i3.37212

Abstract

Multimodal Sentiment Analysis (MSA) aims to infer human sentiment by integrating information from multiple modalities such as text, audio, and video. In real-world scenarios, however, the presence of missing modalities and noisy signals significantly hinders the robustness and accuracy of existing models. While prior works have made progress on these issues, they are typically addressed in isolation, limiting overall effectiveness in practical settings. To jointly mitigate the challenges posed by missing and noisy modalities, we propose a framework called Two-stage Modality Denoising and Complementation (TMDC). TMDC comprises two sequential training stages. In the Intra-Modality Denoising Stage, denoised modality-specific and modality-shared representations are extracted from complete data using dedicated denoising modules, reducing the impact of noise and enhancing representational robustness. In the Inter-Modality Complementation Stage, these representations are leveraged to compensate for missing modalities, thereby enriching the available information and further improving robustness. Extensive evaluations on MOSI, MOSEI, and IEMOCAP demonstrate that TMDC consistently achieves superior performance compared to existing methods, establishing new state-of-the-art results.

Downloads

Published

2026-03-14

How to Cite

Zhuang, Y., Liu, M., Zhang, Y., Deng, J., & Ren, F. (2026). TMDC: A Two-Stage Modality Denoising and Complementation Framework for Multimodal Sentiment Analysis with Missing and Noisy Modalities. Proceedings of the AAAI Conference on Artificial Intelligence, 40(3), 2281–2289. https://doi.org/10.1609/aaai.v40i3.37212

Issue

Section

AAAI Technical Track on Cognitive Modeling & Cognitive Systems