Shi, Y., Long, Q., Wu, Y., & Wang, W. (2026). Causality Matters: How Temporal Information Emerges in Video Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(11), 9006–9014. https://doi.org/10.1609/aaai.v40i11.37856