Gao, L., Zeng, P., Song, J., Li, Y.-F., Liu, W., Mei, T. and Shen, H. T. (2019) “Structured Two-Stream Attention Network for Video Question Answering”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 6391-6398. doi: 10.1609/aaai.v33i01.33016391.