1.
Fang X, Fang W, Wang C, Qu X, Liu D. Rethinking Video-Language Model from the Language Input Perspective. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 28];40(5):3885-93. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/37390