Zhao, Z., Li, L., Zhang, J., Sun, Y., Sheng, X., Yin, H., & Jiang, S. (2025). Heterogeneous Prompt-Guided Entity Inferring and Distilling for Scene-Text Aware Cross-Modal Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 39(10), 10537–10545. https://doi.org/10.1609/aaai.v39i10.33144