(1)
Meng, G.; He, S.; Wang, J.; Dai, T.; Zhang, L.; Zhu, J.; Li, Q.; Wang, G.; Zhang, R.; Jiang, Y. EvdCLIP: Improving Vision-Language Retrieval With Entity Visual Descriptions from Large Language Models. AAAI 2025, 39, 6126-6134.