Cherian, A., Hori, C., Marks, T. K., & Le Roux, J. (2022). (2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 444-453. https://doi.org/10.1609/aaai.v36i1.19922