1.
Gao S, Chen Z, Chen G, Wang W, Lu T. AVSegFormer: Audio-Visual Segmentation with Transformer. AAAI [Internet]. 2024Mar.24 [cited 2024Sep.27];38(11):12155-63. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/29104