Xu, M. (2026) “VP-Bench: A Comprehensive Benchmark for Visual Prompting in Multimodal Large Language Models”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(13), pp. 11332–11341. doi: 10.1609/aaai.v40i13.38114.