Guo, P., Huang, H., He, P., Liu, X., Xiao, T., & Zhang, W. (2025). OpenVIS: Open-vocabulary Video Instance Segmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(3), 3275–3283. https://doi.org/10.1609/aaai.v39i3.32338