1.
Zhou H, Guo X, Zhu Y, Kong AW-K. MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 15];40(16):13620-8. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/38368