Meng, G., He, S., Wang, J., Dai, T., Zhang, L., Zhu, J., … Jiang, Y. (2025). EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(6), 6126–6134. https://doi.org/10.1609/aaai.v39i6.32655