DynamicEarth: How Far Are We from Open-Vocabulary Change Detection?
DOI:
https://doi.org/10.1609/aaai.v40i8.37554Abstract
Monitoring Earth's evolving land covers requires methods capable of detecting changes across a wide range of categories and contexts. Existing change detection methods are hindered by their dependency on predefined classes, reducing their effectiveness in open-world applications. To address this issue, we introduce open-vocabulary change detection (OVCD), a novel task that bridges vision and language to detect changes across any category. Considering the lack of high-quality data and annotation, we propose two training-free frameworks, M-C-I and I-M-C, which leverage and integrate off-the-shelf foundation models for the OVCD task. The insight behind the M-C-I~framework is to discover all potential changes and then classify these changes, while the insight of I-M-C~framework is to identify all targets of interest and then determine whether their states have changed. Based on these two frameworks, we instantiate to obtain several methods, e.g., SAM-DINOv2-SegEarth-OV, Grounding-DINO-SAM2-DINO, etc. Extensive evaluations on 4 benchmark datasets demonstrate the superior generalization and robustness of our OVCD methods over existing supervised and unsupervised methods. To support continued exploration, we release DynamicEarth, a dedicated codebase designed to advance research and application of OVCD.Published
2026-03-14
How to Cite
Li, K., Cao, X., Deng, Y., Pang, C., Xin, Z., Qiao, H., … Wang, Z. (2026). DynamicEarth: How Far Are We from Open-Vocabulary Change Detection?. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6279–6287. https://doi.org/10.1609/aaai.v40i8.37554
Issue
Section
AAAI Technical Track on Computer Vision V