1.
Meng G, He S, Wang J, Dai T, Zhang L, Zhu J, et al. EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 28];39(6):6126-34. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/32655