Gao, L., P. Zeng, J. Song, Y.-F. Li, W. Liu, T. Mei, and H. T. Shen. “Structured Two-Stream Attention Network for Video Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 6391-8, doi:10.1609/aaai.v33i01.33016391.