[1]
H. Wu, K.-T. Cheng, S. Lin, and Z. Wu, “A Study of Finetuning Video Transformers for Multi-view Geometry Tasks”, AAAI, vol. 40, no. 13, pp. 10646–10654, Mar. 2026.