[1]
Z. Shi, L. Zhang, L. Li, and Y. Shen, “Towards Audio-Visual Navigation in Noisy Environments: A Large-Scale Benchmark Dataset and an Architecture Considering Multiple Sound-Sources”, AAAI, vol. 39, no. 14, pp. 14673–14680, Apr. 2025.