Liu, X., Paul, S., Chatterjee, M., & Cherian, A. (2024). CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 3765–3773. https://doi.org/10.1609/aaai.v38i4.28167