Wang, Yaoting, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, and Xi Li. 2024. “Prompting Segmentation With Sound Is Generalizable Audio-Visual Source Localizer”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (6):5669-77. https://doi.org/10.1609/aaai.v38i6.28378.