Zhao, Z., Li, L., Shen, L., Sheng, X., Sun, Y., Kang, F., & Yan, C. (2026). Temporal Calibrating and Distilling for Scene-Text Aware Text-Video Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 40(16), 13323–13331. https://doi.org/10.1609/aaai.v40i16.38335