[1]

Gao, S. et al. 2024. AVSegFormer: Audio-Visual Segmentation with Transformer. Proceedings of the AAAI Conference on Artificial Intelligence. 38, 11 (Mar. 2024), 12155–12163. DOI:https://doi.org/10.1609/aaai.v38i11.29104.