Xiao, Junbin, et al. “Video As Conditional Graph Hierarchy for Multi-Granular Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 3, June 2022, pp. 2804-12, doi:10.1609/aaai.v36i3.20184.