Jenni, S., Black, A., & Collomosse, J. (2023). Audio-Visual Contrastive Learning with Temporal Self-Supervision. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 7996-8004. https://doi.org/10.1609/aaai.v37i7.25967