Kim, S., S. Jeong, E. Kim, I. Kang, and N. Kwak. “Self-Supervised Pre-Training and Contrastive Representation Learning for Multiple-Choice Video QA”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 14, May 2021, pp. 13171-9, https://ojs.aaai.org/index.php/AAAI/article/view/17556.