LI, S.; LI, L.; OUYANG, K.; REN, S.; LIU, Y.; ZHANG, Y.; ZHANG, F.; KONG, L.; LIU, Q.; SUN, X. TEMPLE: Incentivizing Temporal Understanding of Video Large Language Models via Progressive Pre-SFT Alignment. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 8, p. 6378-6386, 2026. DOI: 10.1609/aaai.v40i8.37565. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/37565. Acesso em: 4 may. 2026.