Zhou, Hao, et al. “MACS: Multi-Source Audio-to-Image Generation With Contextual Significance and Semantic Alignment”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 16, Mar. 2026, pp. 13620-8, doi:10.1609/aaai.v40i16.38368.