Li, X., J. Song, L. Gao, X. Liu, W. Huang, X. He, and C. Gan. “Beyond RNNs: Positional Self-Attention With Co-Attention for Video Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 8658-65, doi:10.1609/aaai.v33i01.33018658.