(1)
Gao, L.; Zeng, P.; Song, J.; Li, Y.-F.; Liu, W.; Mei, T.; Shen, H. T. Structured Two-Stream Attention Network for Video Question Answering. AAAI 2019, 33, 6391-6398.