Zhang, Y., L.-M. Po, X. Xu, M. Liu, Y. Wang, W. Ou, Y. Zhao, and W.-Y. Yu. “Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 3, June 2022, pp. 3380-9, doi:10.1609/aaai.v36i3.20248.