Zhang, Zhehao, Ryan A. Rossi, Tong Yu, Franck Dernoncourt, Ruiyi Zhang, Jiuxiang Gu, Sungchul Kim, Xiang Chen, Zichao Wang, and Nedim Lipka. 2026. “VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-Use”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (43):36536-46. https://doi.org/10.1609/aaai.v40i43.40976.