(1)
Wang, Y.; Liu, W.; Li, G.; Ding, J.; Hu, D.; Li, X. Prompting Segmentation With Sound Is Generalizable Audio-Visual Source Localizer. AAAI 2024, 38, 5669-5677.