Zhou, H. (2026) “MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), pp. 13620–13628. doi: 10.1609/aaai.v40i16.38368.