Li, J., Padmakumar, A., Sukhatme, G., & Bansal, M. (2024). VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18517-18526. https://doi.org/10.1609/aaai.v38i17.29813