(1)
Zhou, Z.; Zhou, J.; Qian, W.; Tang, S.; Chang, X.; Guo, D. Dense Audio-Visual Event Localization Under Cross-Modal Consistency and Multi-Temporal Granularity Collaboration. AAAI 2025, 39, 10905-10913.