(1)
Zhang, Y.; Shen, G.; Ning, K.; Ren, T.; Qiu, X.; Wang, M.; Kong, X. Improving Region Representation Learning from Urban Imagery With Noisy Long-Caption Supervision. AAAI 2026, 40, 16397-16405.