Zhang, Y., Po, L.-M., Xu, X., Liu, M., Wang, Y., Ou, W., Zhao, Y., & Yu, W.-Y. (2022). Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3380-3389. https://doi.org/10.1609/aaai.v36i3.20248