Xiao, Junbin, Angela Yao, Zhiyuan Liu, Yicong Li, Wei Ji, and Tat-Seng Chua. “Video As Conditional Graph Hierarchy for Multi-Granular Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 3 (June 28, 2022): 2804-2812. Accessed March 28, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20184.