[1]
A. Cherian, C. Hori, T. K. Marks, and J. Le Roux, “(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering”, AAAI, vol. 36, no. 1, pp. 444-453, Jun. 2022.