StructSR: Refuse Spurious Details in Real-World Image Super-Resolution
DOI:
https://doi.org/10.1609/aaai.v39i5.32532Abstract
Diffusion-based models have shown great promise in real-world image super-resolution (Real-ISR), but often generate content with structural errors and spurious texture details due to the empirical priors and illusions of these models. To address this issue, we introduce StructSR, a simple, effective, and plug-and-play method that enhances structural fidelity and suppresses spurious details for diffusion-based Real-ISR. StructSR operates without the need for additional fine-tuning, external model priors, or high-level semantic knowledge. At its core is the Structure-Aware Screening (SAS) mechanism, which identifies the image with the highest structural similarity to the low-resolution (LR) input in the early inference stage, allowing us to leverage it as a historical structure knowledge to suppress the generation of spurious details. By intervening in the diffusion inference process, StructSR seamlessly integrates with existing diffusion-based Real-ISR models. Our experimental results demonstrate that StructSR significantly improves the fidelity of structure and texture, improving the PSNR and SSIM metrics by an average of 5.27% and 9.36% on a synthetic dataset (DIV2K-Val) and 4.13% and 8.64% on two real-world datasets (RealSR and DRealSR) when integrated with four state-of-the-art diffusion-based Real-ISR methods.Downloads
Published
2025-04-11
How to Cite
Li, Y., Liang, D., Ding, T., & Huang, S.-J. (2025). StructSR: Refuse Spurious Details in Real-World Image Super-Resolution. Proceedings of the AAAI Conference on Artificial Intelligence, 39(5), 5022-5030. https://doi.org/10.1609/aaai.v39i5.32532
Issue
Section
AAAI Technical Track on Computer Vision IV