WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency (Student Abstract)

Authors

  • Pranav Jeevan Indian Institute of Technology Bombay, Mumbai, India
  • Neeraj Nixon Indian Institute of Technology Bombay, Mumbai, India
  • Amit Sethi Indian Institute of Technology Bombay, Mumbai, India

DOI:

https://doi.org/10.1609/aaai.v39i28.35262

Abstract

Recent advancements in single image super-resolution have been predominantly driven by token mixers and transformer architectures. WaveMixSR utilized the WaveMix architecture, employing a two-dimensional discrete wavelet transform for spatial token mixing, achieving superior performance in super-resolution tasks with remarkable resource efficiency. In this work, we present an enhanced version of the WaveMixSR architecture by (1) replacing the traditional transpose convolution layer with a pixel shuffle operation and (2) implementing a multistage design for higher resolution tasks (4x). Our experiments demonstrate that our enhanced model -- WaveMixSR-V2 -- outperforms other architectures in multiple super-resolution tasks, achieving state-of-the-art for the BSD100 dataset, while also consuming fewer resources and exhibiting higher parameter efficiency and throughput.

Published

2025-04-11

How to Cite

Jeevan, P., Nixon, N., & Sethi, A. (2025). WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 29390–29392. https://doi.org/10.1609/aaai.v39i28.35262