Zhou, D., Zhou, X., Hu, D., Zhou, H., Bai, L., Liu, Z., & Ouyang, W. (2022). SepFusion: Finding Optimal Fusion Structures for Visual Sound Separation. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3544-3552. https://doi.org/10.1609/aaai.v36i3.20266