[1]

Tang, C. et al. 2026. TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 11 (Mar. 2026), 9368–9376. DOI:https://doi.org/10.1609/aaai.v40i11.37896.