Wang, Xinyi, Na Zhao, Zhiyuan Han, Dan Guo, and Xun Yang. “AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-Based Referring”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 8 (April 11, 2025): 8006-8014. Accessed April 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32863.