Bao, Peijun, Wenhan Yang, Boon Poh Ng, Meng Hwa Er, and Alex C. Kot. “Cross-Modal Label Contrastive Learning for Unsupervised Audio-Visual Event Localization”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 1 (June 26, 2023): 215-222. Accessed October 15, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/25093.