Chen, Y., Zhu, H., Wang, J., Chen, K., & Qian, X. (2026). AV-SSAN: Audio-Visual Selective DOA Estimation Through Explicit Multi-Band Semantic-Spatial Alignment. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 20409–20417. https://doi.org/10.1609/aaai.v40i25.39175