Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain

Authors

  • Xuanhua He Hefei Institutes of Physical Science, Chinese Academy of Sciences University of Science and Technology of China
  • Tao Hu Hefei Institutes of Physical Science, Chinese Academy of Sciences University of Science and Technology of China
  • Guoli Wang Horizon Robotics
  • Zejin Wang Horizon Robotics
  • Run Wang Horizon Robotics
  • Qian Zhang Horizon Robotics
  • Keyu Yan Hefei Institutes of Physical Science, Chinese Academy of Sciences University of Science and Technology of China
  • Ziyi Chen Tencent Technology
  • Rui Li Hefei Institutes of Physical Science, Chinese Academy of Sciences
  • Chengjun Xie Hefei Institutes of Physical Science, Chinese Academy of Sciences
  • Jie Zhang Hefei Institutes of Physical Science, Chinese Academy of Sciences
  • Man Zhou Nanyang Technological University

DOI:

https://doi.org/10.1609/aaai.v38i3.27985

Keywords:

CV: Low Level & Physics-based Vision, CV: Computational Photography, Image & Video Synthesis

Abstract

RAW to sRGB mapping, which aims to convert RAW images from smartphones into RGB form equivalent to that of Digital Single-Lens Reflex (DSLR) cameras, has become an important area of research. However, current methods often ignore the difference between cell phone RAW images and DSLR camera RGB images, a difference that goes beyond the color matrix and extends to spatial structure due to resolution variations. Recent methods directly rebuild color mapping and spatial structure via shared deep representation, limiting optimal performance. Inspired by Image Signal Processing (ISP) pipeline, which distinguishes image restoration and enhancement, we present a novel Neural ISP framework, named FourierISP. This approach breaks the image down into style and structure within the frequency domain, allowing for independent optimization. FourierISP is comprised of three subnetworks: Phase Enhance Subnet for structural refinement, Amplitude Refine Subnet for color learning, and Color Adaptation Subnet for blending them in a smooth manner. This approach sharpens both color and structure, and extensive evaluations across varied datasets confirm that our approach realizes state-of-the-art results. Code will be available at https://github.com/alexhe101/FourierISP.

Published

2024-03-24

How to Cite

He, X., Hu, T., Wang, G., Wang, Z., Wang, R., Zhang, Q., … Zhou, M. (2024). Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain. Proceedings of the AAAI Conference on Artificial Intelligence, 38(3), 2130–2138. https://doi.org/10.1609/aaai.v38i3.27985

Issue

Section

AAAI Technical Track on Computer Vision II