Cheng, Zesen, Kehan Li, Li Hao, Peng Jin, Xiawu Zheng, Chang Liu, and Jie Chen. “Aligning Instance Brownian Bridge With Texts for Open-Vocabulary Video Instance Segmentation”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 3 (April 11, 2025): 2482–2490. Accessed May 10, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32250.