[1]
J. Xiao, A. Yao, Z. Liu, Y. Li, W. Ji, and T.-S. Chua, “Video as Conditional Graph Hierarchy for Multi-Granular Question Answering”, AAAI, vol. 36, no. 3, pp. 2804-2812, Jun. 2022.