Deep Unfolded Network with Intrinsic Supervision for Pan-Sharpening

Authors

  • Hebaixu Wang Wuhan University
  • Meiqi Gong Wuhan University
  • Xiaoguang Mei Wuhan University
  • Hao Zhang Wuhan University
  • Jiayi Ma Wuhan University

DOI:

https://doi.org/10.1609/aaai.v38i6.28350

Keywords:

CV: Multi-modal Vision, CV: Computational Photography, Image & Video Synthesis

Abstract

Existing deep pan-sharpening methods lack the learning of complementary information between PAN and MS modalities in the intermediate layers, and exhibit low interpretability due to their black-box designs. To this end, an interpretable deep unfolded network with intrinsic supervision for pan-sharpening is proposed. Building upon the observation degradation process, it formulates the pan-sharpening task as a variational model minimization with spatial consistency prior and spectral projection prior. The former prior requires a joint component decomposition of PAN and MS images to extract intrinsic features. By being supervised in the intermediate layers, it can selectively provide high-frequency information for spatial enhancement. The latter prior constrains the intensity correlation between MS and PAN images derived from physical observations, so as to improve spectral fidelity. To further enhance the transparency of network design, we develop an iterative solution algorithm following the half-quadratic splitting to unfold the deep model. It rigorously adheres to the variational model, significantly enhancing the interpretability behind network design and efficiently alternating the optimization of the network. Extensive experiments demonstrate the advantages of our method compared to state-of-the-arts, showcasing its remarkable generalization capability to real-world scenes. Our code is publicly available at https://github.com/Baixuzx7/DISPNet.

Published

2024-03-24

How to Cite

Wang, H., Gong, M., Mei, X., Zhang, H., & Ma, J. (2024). Deep Unfolded Network with Intrinsic Supervision for Pan-Sharpening. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5419-5426. https://doi.org/10.1609/aaai.v38i6.28350

Issue

Section

AAAI Technical Track on Computer Vision V