ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

Authors

  • Qi Zang School of Artificial Intelligence, Xidian University, China
  • Jiayi Yang School of Artificial Intelligence, Xidian University, China
  • Shuang Wang School of Artificial Intelligence, Xidian University, China
  • Dong Zhao School of Artificial Intelligence, Xidian University, China
  • Wenjun Yi School of Artificial Intelligence, Xidian University, China
  • Zhun Zhong School of Computer Science and Information Engineering, Hefei University of Technology, China

DOI:

https://doi.org/10.1609/aaai.v39i9.33058

Abstract

Data-driven deep learning models have enabled tremendous progress in change detection (CD) with the support of pixel-level annotations. However, collecting diverse data and manually annotating them is costly, laborious, and knowledge-intensive. Existing generative methods for CD data synthesis show competitive potential in addressing this issue but still face the following limitations: 1) difficulty in flexibly controlling change events, 2) dependence on additional data to train the data generators, 3) focus on specific change detection tasks. To this end, this paper focuses on the semantic CD (SCD) task and develops a multi-temporal SCD data generator ChangeDiff by exploring powerful diffusion models. ChangeDiff innovatively generates change data in two steps: first, it uses text prompts and a text-to-layout (T2L) model to create continuous layouts, and then it employs layout-to-image (L2I) to convert these layouts into images. Specifically, we propose multi-class distribution-guided text prompts (MCDG-TP), allowing for layouts to be generated flexibly through controllable classes and their corresponding ratios. Subsequently, to generalize the T2L model to the proposed MCDG-TP, a class distribution refinement loss is further designed as training supervision. Our generated data shows significant progress in temporal continuity, spatial diversity, and quality realism, empowering change detectors with accuracy and transferability.

Published

2025-04-11

How to Cite

Zang, Q., Yang, J., Wang, S., Zhao, D., Yi, W., & Zhong, Z. (2025). ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model. Proceedings of the AAAI Conference on Artificial Intelligence, 39(9), 9763–9771. https://doi.org/10.1609/aaai.v39i9.33058

Issue

Section

AAAI Technical Track on Computer Vision VIII