Zhao, Zhiqian, et al. “Heterogeneous Prompt-Guided Entity Inferring and Distilling for Scene-Text Aware Cross-Modal Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 39, no. 10, Apr. 2025, pp. 10537-45, doi:10.1609/aaai.v39i10.33144.