Hu, Xiantao, Ying Tai, Xu Zhao, Chen Zhao, Zhenyu Zhang, Jun Li, Bineng Zhong, and Jian Yang. 2025. “Exploiting Multimodal Spatial-Temporal Patterns for Video Object Tracking”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (4):3581-89. https://doi.org/10.1609/aaai.v39i4.32372.