1.
Tian Y, Ma T, Xie L, Ye Q. ChatterBox: Multimodal Referring and Grounding with Chain-of-Questions. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 10];39(7):7401-9. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/32796