[1]
Y. Shi, Q. Long, Y. Wu, and W. Wang, “Causality Matters: How Temporal Information Emerges in Video Language Models”, AAAI, vol. 40, no. 11, pp. 9006–9014, Mar. 2026.