Xiao, J., Yao, A., Liu, Z., Li, Y., Ji, W., & Chua, T.-S. (2022). Video as Conditional Graph Hierarchy for Multi-Granular Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 2804-2812. https://doi.org/10.1609/aaai.v36i3.20184