Fusing Time-Domain and Constellation Views: A Multimodal MAE for Wireless Signals (Student Abstract)

Authors

  • Agniva Banerjee Indian Institute of Science Education and Research Bhopal
  • Arijit Sen Indian Institute of Science Education and Research Bhopal

DOI:

https://doi.org/10.1609/aaai.v40i48.42187

Abstract

This paper introduces a multi-modal masked autoencoder (MMAE) that jointly denoises and classifies signals by fusing time-domain IQ sequences and constellation diagrams within a cross-attentive transformer. This approach treats noise as a learnable modality to enhance robustness, a dynamic masking curriculum combined with domain regularization training and a hybrid loss function to promote domain-invariant features. Experimentation on the RadioML 2018.01A and RadioML22 datasets demonstrates superior accuracy across different SNR levels while using substantially less labeled data than state-of-the-art approaches.

Downloads

Published

2026-03-14

How to Cite

Banerjee, A., & Sen, A. (2026). Fusing Time-Domain and Constellation Views: A Multimodal MAE for Wireless Signals (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41129–41131. https://doi.org/10.1609/aaai.v40i48.42187