1.
Huang H, Cen M, Tan K, Quan X, Huang G, Zhang H. GraphCoT-VLA: A 3D Spatial-Aware Reasoning Vision-Language-Action Model for Robotic Manipulation with Ambiguous Instructions. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 17];40(22):18324-32. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/38896