GSDD: Generative Space Dataset Distillation for Image Super-resolution
DOI:
https://doi.org/10.1609/aaai.v38i7.28534Keywords:
CV: Computational Photography, Image & Video Synthesis, CV: Low Level & Physics-based Vision, CV: Learning & Optimization for CVAbstract
Single image super-resolution (SISR), especially in the real world, usually builds a large amount of LR-HR image pairs to learn representations that contain rich textural and structural information. However, relying on massive data for model training not only reduces training efficiency, but also causes heavy data storage burdens. In this paper, we attempt a pioneering study on dataset distillation (DD) for SISR problems to explore how data could be slimmed and compressed for the task. Unlike previous coreset selection methods which select a few typical examples directly from the original data, we remove the limitation that the selected data cannot be further edited, and propose to synthesize and optimize samples to preserve more task-useful representations. Concretely, by utilizing pre-trained GANs as a suitable approximation of realistic data distribution, we propose GSDD, which distills data in a latent generative space based on GAN-inversion techniques. By optimizing them to match with the practical data distribution in an informative feature space, the distilled data could then be synthesized. Experimental results demonstrate that when trained with our distilled data, GSDD can achieve comparable performance to the state-of-the-art (SOTA) SISR algorithms, while a nearly ×8 increase in training efficiency and a saving of almost 93.2% data storage space can be realized. Further experiments on challenging real-world data also demonstrate the promising generalization ability of GSDD.Downloads
Published
2024-03-24
How to Cite
Zhang, H., Su, S., Zhu, Y., Sun, J., & Zhang, Y. (2024). GSDD: Generative Space Dataset Distillation for Image Super-resolution. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7069-7077. https://doi.org/10.1609/aaai.v38i7.28534
Issue
Section
AAAI Technical Track on Computer Vision VI