(1)
Zhao, X.; Wang, Y.; Jin, P. Audio-Visual Adaptive Fusion Network for Question Answering Based on Contrastive Learning. AAAI 2025, 39, 10483-10491.