Fang, Xiang, Wanlong Fang, Changshuo Wang, Xiaoye Qu, and Daizong Liu. 2026. “Rethinking Video-Language Model from the Language Input Perspective”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (5):3885-93. https://doi.org/10.1609/aaai.v40i5.37390.