[1]
X. Ding, J. Gao, C. Pan, W. Wang, and J. Qin, “History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language Navigation”, AAAI, vol. 40, no. 22, pp. 18225–18233, Mar. 2026.