Realism Control One-step Diffusion for Real-world Image Super Resolution

Zongliang Wu; Siming Zheng; Peng-Tao Jiang; Xin Yuan

doi:10.1609/aaai.v40i13.38067

Authors

Zongliang Wu Zhejiang University, Hangzhou, China School of Engineering, Westlake University, Hangzhou, China vivo Mobile Communication Co., Ltd
Siming Zheng vivo Mobile Communication Co., Ltd
Peng-Tao Jiang vivo Mobile Communication Co., Ltd
Xin Yuan School of Engineering, Westlake University, Hangzhou, China

DOI:

https://doi.org/10.1609/aaai.v40i13.38067

Abstract

Pre-trained diffusion models have shown great potential in real-world image super-resolution (Real-ISR) tasks by enabling high-resolution reconstructions. While one-step diffusion (OSD) methods significantly improve efficiency compared to traditional multi-step approaches, they still have limitations in balancing fidelity and realism across diverse scenarios. Since the OSDs for SR are usually trained or distilled by a single timestep, they lack flexible control mechanisms to adaptively prioritize these competing objectives, which are inherently manageable in multi-step methods through adjusting sampling steps. To address this challenge, we propose a Realism Controlled One-step Diffusion (RCOD) framework for Real-ISR. RCOD provides a latent domain grouping strategy that enables explicit control over fidelity-realism trade-offs during the noise prediction phase with minimal training paradigm modifications and original training data. A degradation-aware sampling strategy is also introduced to align distillation regularization with the grouping strategy and enhance the controlling of trade-offs. Moreover, a visual prompt injection module is used to replace conventional text prompts with degradation-aware visual tokens, enhancing both restoration accuracy and semantic consistency. Our method achieves superior fidelity and perceptual quality while maintaining computational efficiency. Extensive experiments demonstrate that RCOD outperforms state-of-the-art OSD methods in both quantitative metrics and visual qualities, with flexible realism control capabilities in the inference stage.

Realism Control One-step Diffusion for Real-world Image Super Resolution

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information