Shi, Yumeng, et al. “Causality Matters: How Temporal Information Emerges in Video Language Models”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 11, Mar. 2026, pp. 9006-14, doi:10.1609/aaai.v40i11.37856.