Zhang, Lingfeng, Haoxiang Fu, Xiaoshuai Hao, Shuyi Zhang, Qiang Zhang, Rui Liu, Long Chen, and Wenbo Ding. “What You See Is What You Reach: Towards Spatial Navigation With High-Level Human Instructions”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 15 (March 14, 2026): 12627–12635. Accessed May 13, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/38258.