Multi-Level Blur-Aware Stable Diffusion for Region-Adaptive Defocus Deblurring

Authors

  • Xiaopan Li School of Information Engineering, Hubei University of Economics, China
  • Yi Jiang Wuhan Guide Infrared Co., Ltd.
  • Shiqian Wu Institute of Advanced Displays and Imaging, Henan Academy of Sciences, China School of Electronic Information, Wuhan University of Science and Technology, China
  • Shoulie Xie Institute for Infocomm Research, A*STAR, Singapore
  • Sos Agaian College of Staten Island, City University of New York, USA

DOI:

https://doi.org/10.1609/aaai.v40i8.37579

Abstract

Defocus blur, common in shallow depth-of-field photography, varies across image regions and is challenging to accurately estimate and restore. Existing deblurring methods often struggle to capture fine structural textures and do not effectively adapt to regional differences in blur. We propose Multi-Level Blur-Aware Stable Diffusion (MBSD), a novel framework that explicitly integrates regional blur recognition into a diffusion-based image restoration process. MBSD assigns blur-level labels to image patches using a Patch Blur Annotator (PBA), guiding a Multi-Scale Blur Estimator (MSBE) to predict soft blur probabilities and generate routing weights. These weights control a Blur-Adaptive Expert Mixer (BAEM), which adaptively combines features based on local blur severity. The features are then passed to a text-to-image diffusion model via a cross-attention mechanism, enabling region-specific restoration. Extensive experiments on public benchmarks demonstrate that MBSD delivers superior perceptual quality while maintaining competitive PSNR and SSIM, consistently outperforming state-of-the-art methods.

Published

2026-03-14

How to Cite

Li, X., Jiang, Y., Wu, S., Xie, S., & Agaian, S. (2026). Multi-Level Blur-Aware Stable Diffusion for Region-Adaptive Defocus Deblurring. Proceedings of the AAAI Conference on Artificial Intelligence, 40(8), 6504–6512. https://doi.org/10.1609/aaai.v40i8.37579

Issue

Section

AAAI Technical Track on Computer Vision V