Shi, Zhanbo, Lin Zhang, Linfei Li, and Ying Shen. “Towards Audio-Visual Navigation in Noisy Environments: A Large-Scale Benchmark Dataset and an Architecture Considering Multiple Sound-Sources”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 14 (April 11, 2025): 14673–14680. Accessed May 7, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/33608.