Li, Y., Liu, M., Li, Z., Bian, Y., Wang, X., Zhai, E., & Wang, Y. (2026). Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6726–6734. https://doi.org/10.1609/aaai.v40i8.37604