Zhang, Y. (2026) “Improving Region Representation Learning from Urban Imagery with Noisy Long-Caption Supervision”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(19), pp. 16397–16405. doi: 10.1609/aaai.v40i19.38678.