Zhou, J., Zhou, Z., Zhou, Y., Mao, Y., Duan, Z., & Guo, D. (2026). CLASP: Cross-modal Salient Anchor-based Semantic Propagation for Weakly-supervised Dense Audio-Visual Event Localization. Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), 13674–13682. https://doi.org/10.1609/aaai.v40i16.38374