YUAN, Xu; ZHOU, Li; SUN, Zenghui; ZHOU, Zikun; LAN, Jinsong. Instruction-guided Multi-Granularity Segmentation and Captioning with Large Multimodal Model. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 39, n. 9, p. 9725–9733, 2025. DOI: 10.1609/aaai.v39i9.33054. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/33054. Acesso em: 13 may. 2026.