(1)
Guo, X.; Yang, X.; Zhang, L.; Yang, J.; Wang, Z.; Luan, J. AV-Edit: Multimodal Generative Sound Effect Editing via Audio-Visual Semantic Joint Control. AAAI 2026, 40, 21504-21512.