Pan-Sharpening with Customized Transformer and Invertible Neural Network
Keywords:Computer Vision (CV)
AbstractIn remote sensing imaging systems, pan-sharpening is an important technique to obtain high-resolution multispectral images from a high-resolution panchromatic image and its corresponding low-resolution multispectral image. Owing to the powerful learning capability of convolution neural network (CNN), CNN-based methods have dominated this field. However, due to the limitation of the convolution operator, long-range spatial features are often not accurately obtained, thus limiting the overall performance. To this end, we propose a novel and effective method by exploiting a customized transformer architecture and information-lossless invertible neural module for long-range dependencies modeling and effective feature fusion in this paper. Specifically, the customized transformer formulates the PAN and MS features as queries and keys to encourage joint feature learning across two modalities while the designed invertible neural module enables effective feature fusion to generate the expected pan-sharpened results. To the best of our knowledge, this is the first attempt to introduce transformer and invertible neural network into pan-sharpening field. Extensive experiments over different kinds of satellite datasets demonstrate that our method outperforms state-of-the-art algorithms both visually and quantitatively with fewer parameters and flops. Further, the ablation experiments also prove the effectiveness of the proposed customized long-range transformer and effective invertible neural feature fusion module for pan-sharpening.
How to Cite
Zhou, M., Huang, J., Fang, Y., Fu, X., & Liu, A. (2022). Pan-Sharpening with Customized Transformer and Invertible Neural Network. Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), 3553-3561. https://doi.org/10.1609/aaai.v36i3.20267
AAAI Technical Track on Computer Vision III