[1]
L. Zhang, “What You See Is What You Reach: Towards Spatial Navigation with High-Level Human Instructions”, AAAI, vol. 40, no. 15, pp. 12627–12635, Mar. 2026.