Li, Zongyi, Li Jianbo, Yuxuan Shi, Jiazhong Chen, Shijuan Huang, Linnan Tu, Fei Shen, and Hefei Ling. “Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 5 (April 11, 2025): 5119-5127. Accessed May 4, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/32543.