[1]
Chen, Y. et al. 2026. AV-SSAN: Audio-Visual Selective DOA Estimation Through Explicit Multi-Band Semantic-Spatial Alignment. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 25 (Mar. 2026), 20409–20417. DOI:https://doi.org/10.1609/aaai.v40i25.39175.