[1]
Jiang, H., Jin, Y., Sun, Z., Xu, K., Xu, K., Chen, L., Song, Y., Gai, K. and Mu, Y. 2025. Granularity-Adaptive Spatial Evidence Tokenization for Video Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 4 (Apr. 2025), 3976-3984. DOI:https://doi.org/10.1609/aaai.v39i4.32416.