[1]
J. Gao, Q. Qiao, T. Wu, Z. Wang, Z. Cao, and W. Li, “AIM: Let Any Multimodal Large Language Models Embrace Efficient In-Context Learning”, AAAI, vol. 39, no. 3, pp. 3077–3085, Apr. 2025.