Li, D., R. Li, L. Wang, Y. Wang, J. Qi, L. Zhang, T. Liu, Q. Xu, and H. Lu. “You Only Infer Once: Cross-Modal Meta-Transfer for Referring Video Object Segmentation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, no. 2, June 2022, pp. 1297-05, doi:10.1609/aaai.v36i2.20017.