Self-Supervised One-Step Diffusion Refinement for Snapshot Compressive Imaging

Shaoguang Huang; Yunzhen Wang; Haijin Zeng; Hongyu Chen; Hongyan Zhang

doi:10.1609/aaai.v40i7.37423

Authors

Shaoguang Huang School of Computer Science, China University of Geosciences, Wuhan, China
Yunzhen Wang School of Computer Science, China University of Geosciences, Wuhan, China
Haijin Zeng Department of Psychiatry, Harvard University, Cambridge, USA
Hongyu Chen School of Computer Science, China University of Geosciences, Wuhan, China
Hongyan Zhang School of Computer Science, China University of Geosciences, Wuhan, China

DOI:

https://doi.org/10.1609/aaai.v40i7.37423

Abstract

Snapshot compressive imaging (SCI) captures multispectral images (MSIs) using a single coded two-dimensional (2-D) measurement, but reconstructing high-fidelity MSIs from these compressed inputs remains a fundamentally ill-posed challenge. Recent diffusion-based methods improve quality but are limited by scarce MSI training data, domain shifts from RGB-pretrained models, and slow multi-step sampling. These drawbacks restrict their practicality in real-world applications. Unlike prior approaches that rely on expensive iterative refinement or subspace-based diffusion embeddings (e.g., DiffSCI, PSR-SCI)—we introduce a fundamentally different paradigm: a self-supervised One-Step Diffusion (OSD) framework designed specifically for SCI. The key novelty lies in using a single-step diffusion refiner to correct an initial reconstruction, eliminating iterative denoising entirely while preserving generative quality. Moreover, we adopt a self-supervised equivariant learning strategy to train both the predictor and refiner directly from raw 2-D measurements, enabling generalization to unseen domains without ground-truth MSI. To further address limited MSI data, we design a band-selection–driven distillation strategy that transfers core generative priors from large-scale RGB datasets, effectively bridging the domain gap. Extensive experiments confirm that our approach sets a new standard—yielding PSNR gains of 3.44dB, 1.61dB, and 0.28dB on the Harvard, NTIRE, and ICVL datasets respectively, while cutting reconstruction time from 8.9s to just 0.22s per image. These gains in efficiency and adaptability advance SCI reconstruction, enabling accurate and practical real-world deployment.

Self-Supervised One-Step Diffusion Refinement for Snapshot Compressive Imaging

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information