[1]
K. Ishihara, K. Sasaki, T. Takahashi, D. Shiono, and Y. Yamaguchi, “STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes”, AAAI, vol. 40, no. 7, pp. 5257–5266, Mar. 2026.