Li, Y. (2026) “Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), pp. 6726–6734. doi: 10.1609/aaai.v40i8.37604.