Zhang, Yimei, Guojiang Shen, Kaili Ning, Tongwei Ren, Xuebo Qiu, Mengmeng Wang, and Xiangjie Kong. “Improving Region Representation Learning from Urban Imagery With Noisy Long-Caption Supervision”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 19 (March 14, 2026): 16397–16405. Accessed May 16, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/38678.