Gao, Lianli, Pengpeng Zeng, Jingkuan Song, Yuan-Fang Li, Wu Liu, Tao Mei, and Heng Tao Shen. “Structured Two-Stream Attention Network for Video Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence 33, no. 01 (July 17, 2019): 6391-6398. Accessed November 29, 2022. https://ojs.aaai.org/index.php/AAAI/article/view/4602.