Bhosale, Swapnil, Haosen Yang, Diptesh Kanojia, Jiankang Deng, and Xiatian Zhu. 2025. “Unsupervised Audio-Visual Segmentation With Modality Alignment”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (15):15567-75. https://doi.org/10.1609/aaai.v39i15.33709.