Bao, P., Yang, W., Ng, B. P., Er, M. H., & Kot, A. C. (2023). Cross-Modal Label Contrastive Learning for Unsupervised Audio-Visual Event Localization. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 215-222. https://doi.org/10.1609/aaai.v37i1.25093