DynamicEarth: How Far Are We from Open-Vocabulary Change Detection?

Authors

  • Kaiyu Li School of Software Engineering, Xi’an Jiaotong University
  • Xiangyong Cao School of Computer Science and Technology, Xi’an Jiaotong University Ministry of Education Key Laboratory of Intelligent Networks and Network Security
  • Yupeng Deng Aerospace Information Research Institute, Chinese Academy of Sciences
  • Chao Pang School of Computer Science, Wuhan University
  • Zepeng Xin College of Artificial Intelligence, Xi’an Jiaotong University
  • Hui Qiao China Telecom Shaanxi Branch
  • Tieliang Gong School of Computer Science and Technology, Xi’an Jiaotong University
  • Deyu Meng School of Mathematics and Statistics, Xi’an Jiaotong University Ministry of Education Key Laboratory of Intelligent Networks and Network Security
  • Zhi Wang School of Software Engineering, Xi’an Jiaotong University

DOI:

https://doi.org/10.1609/aaai.v40i8.37554

Abstract

Monitoring Earth's evolving land covers requires methods capable of detecting changes across a wide range of categories and contexts. Existing change detection methods are hindered by their dependency on predefined classes, reducing their effectiveness in open-world applications. To address this issue, we introduce open-vocabulary change detection (OVCD), a novel task that bridges vision and language to detect changes across any category. Considering the lack of high-quality data and annotation, we propose two training-free frameworks, M-C-I and I-M-C, which leverage and integrate off-the-shelf foundation models for the OVCD task. The insight behind the M-C-I~framework is to discover all potential changes and then classify these changes, while the insight of I-M-C~framework is to identify all targets of interest and then determine whether their states have changed. Based on these two frameworks, we instantiate to obtain several methods, e.g., SAM-DINOv2-SegEarth-OV, Grounding-DINO-SAM2-DINO, etc. Extensive evaluations on 4 benchmark datasets demonstrate the superior generalization and robustness of our OVCD methods over existing supervised and unsupervised methods. To support continued exploration, we release DynamicEarth, a dedicated codebase designed to advance research and application of OVCD.

Downloads

Published

2026-03-14

How to Cite

Li, K., Cao, X., Deng, Y., Pang, C., Xin, Z., Qiao, H., … Wang, Z. (2026). DynamicEarth: How Far Are We from Open-Vocabulary Change Detection?. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6279–6287. https://doi.org/10.1609/aaai.v40i8.37554

Issue

Section

AAAI Technical Track on Computer Vision V