[1]
Y. Chen, H. Zhu, J. Wang, K. Chen, and X. Qian, “AV-SSAN: Audio-Visual Selective DOA Estimation Through Explicit Multi-Band Semantic-Spatial Alignment”, AAAI, vol. 40, no. 25, pp. 20409–20417, Mar. 2026.