Fusing Time-Domain and Constellation Views: A Multimodal MAE for Wireless Signals (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v40i48.42187Abstract
This paper introduces a multi-modal masked autoencoder (MMAE) that jointly denoises and classifies signals by fusing time-domain IQ sequences and constellation diagrams within a cross-attentive transformer. This approach treats noise as a learnable modality to enhance robustness, a dynamic masking curriculum combined with domain regularization training and a hybrid loss function to promote domain-invariant features. Experimentation on the RadioML 2018.01A and RadioML22 datasets demonstrates superior accuracy across different SNR levels while using substantially less labeled data than state-of-the-art approaches.Published
2026-03-14
How to Cite
Banerjee, A., & Sen, A. (2026). Fusing Time-Domain and Constellation Views: A Multimodal MAE for Wireless Signals (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41129–41131. https://doi.org/10.1609/aaai.v40i48.42187
Issue
Section
AAAI Student Abstract and Poster Program