(1)
Tian, Y.; Ma, T.; Xie, L.; Ye, Q. ChatterBox: Multimodal Referring and Grounding With Chain-of-Questions. AAAI 2025, 39, 7401-7409.