LI, Dezhuang; LI, Ruoqi; WANG, Lijun; WANG, Yifan; QI, Jinqing; ZHANG, Lu; LIU, Ting; XU, Qingquan; LU, Huchuan. You Only Infer Once: Cross-Modal Meta-Transfer for Referring Video Object Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 36, n. 2, p. 1297–1305, 2022. DOI: 10.1609/aaai.v36i2.20017. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/20017. Acesso em: 15 may. 2026.