Zhang, Yujia, Lai-Man Po, Xuyuan Xu, Mengyang Liu, Yexin Wang, Weifeng Ou, Yuzhi Zhao, and Wing-Yin Yu. “Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 3 (June 28, 2022): 3380-3389. Accessed April 18, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20248.