Sarkar, Pritam, and Ali Etemad. “Self-Supervised Audio-Visual Representation Learning With Relaxed Cross-Modal Synchronicity”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 8, June 2023, pp. 9723-32, doi:10.1609/aaai.v37i8.26162.