[1]
Y. Hu, “LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition”, AAAI, vol. 38, no. 3, pp. 2274–2284, Mar. 2024.