Shi, Z., Zhang, L., Li, L., & Shen, Y. (2025). Towards Audio-Visual Navigation in Noisy Environments: A Large-Scale Benchmark Dataset and an Architecture Considering Multiple Sound-Sources. Proceedings of the AAAI Conference on Artificial Intelligence, 39(14), 14673–14680. https://doi.org/10.1609/aaai.v39i14.33608