[1]
K. H. I. Arif, J. Yoon, D. S. Nikolopoulos, H. Vandierendonck, D. John, and B. Ji, “HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models”, AAAI, vol. 39, no. 2, pp. 1773–1781, Apr. 2025.