Ma, Yinchao, Yuyang Tang, Wenfei Yang, Tianzhu Zhang, Jinpeng Zhang, and Mengxue Kang. 2024. “Unifying Visual and Vision-Language Tracking via Contrastive Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (5):4107-16. https://doi.org/10.1609/aaai.v38i5.28205.