(1)
Fang, X.; Fang, W.; Wang, C.; Qu, X.; Liu, D. Rethinking Video-Language Model from the Language Input Perspective. AAAI 2026, 40, 3885-3893.