(1)
Jiang, H.; Jin, Y.; Sun, Z.; Xu, K.; Xu, K.; Chen, L.; Song, Y.; Gai, K.; Mu, Y. Granularity-Adaptive Spatial Evidence Tokenization for Video Question Answering. AAAI 2025, 39, 3976-3984.