LI, Yuzhen; LIU, Min; LI, Zhaoyang; BIAN, Yuan; WANG, Xueping; ZHAI, Erbo; WANG, Yaonan. Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 8, p. 6726–6734, 2026. DOI: 10.1609/aaai.v40i8.37604. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/37604. Acesso em: 15 may. 2026.