Zhang, L., Fu, H., Hao, X., Zhang, S., Zhang, Q., Liu, R., … Ding, W. (2026). What You See Is What You Reach: Towards Spatial Navigation with High-Level Human Instructions. Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12627–12635. https://doi.org/10.1609/aaai.v40i15.38258