TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework Using Self-Supervised Multi-Task Learning

Authors

  • Linhao Qu Digital Medical Research Center, School of Basic Medical Science, Fudan University
  • Shaolei Liu Digital Medical Research Center, School of Basic Medical Science, Fudan University
  • Manning Wang Digital Medical Research Center, School of Basic Medical Science, Fudan University
  • Zhijian Song Digital Medical Research Center, School of Basic Medical Science, Fudan University

DOI:

https://doi.org/10.1609/aaai.v36i2.20109

Keywords:

Computer Vision (CV)

Abstract

In this paper, we propose TransMEF, a transformer-based multi-exposure image fusion framework that uses self-supervised multi-task learning. The framework is based on an encoder-decoder network, which can be trained on large natural image datasets and does not require ground truth fusion images. We design three self-supervised reconstruction tasks according to the characteristics of multi-exposure images and conduct these tasks simultaneously using multi-task learning; through this process, the network can learn the characteristics of multi-exposure images and extract more generalized features. In addition, to compensate for the defect in establishing long-range dependencies in CNN-based architectures, we design an encoder that combines a CNN module with a transformer module. This combination enables the network to focus on both local and global information. We evaluated our method and compared it to 11 competitive traditional and deep learning-based methods on the latest released multi-exposure image fusion benchmark dataset, and our method achieved the best performance in both subjective and objective evaluations. Code will be available at https://github.com/miccaiif/TransMEF.

Downloads

Published

2022-06-28

How to Cite

Qu, L., Liu, S., Wang, M., & Song, Z. (2022). TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework Using Self-Supervised Multi-Task Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 36(2), 2126-2134. https://doi.org/10.1609/aaai.v36i2.20109

Issue

Section

AAAI Technical Track on Computer Vision II