NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-world Video Super-Resolution

Authors

  • Yexing Song Guangdong University of Technology
  • Meilin Wang Guangdong University of Technology
  • Zhijing Yang Guangdong University of Technology
  • Xiaoyu Xian CRRC Academy
  • Yukai Shi Guangdong University of Technology

DOI:

https://doi.org/10.1609/aaai.v38i9.28942

Keywords:

KRR: Argumentation, CV: Low Level & Physics-based Vision, ML: Transfer, Domain Adaptation, Multi-Task Learning, ML: Unsupervised & Self-Supervised Learning

Abstract

The capability of video super-resolution (VSR) to synthesize high-resolution (HR) video from ideal datasets has been demonstrated in many works. However, applying the VSR model to real-world video with unknown and complex degradation remains a challenging task. First, existing degradation metrics in most VSR methods are not able to effectively simulate real-world noise and blur. On the contrary, simple combinations of classical degradation are used for real-world noise modeling, which led to the VSR model often being violated by out-of-distribution noise. Second, many SR models focus on noise simulation and transfer. Nevertheless, the sampled noise is monotonous and limited. To address the aforementioned problems, we propose a Negatives augmentation strategy for generalized noise modeling in Video Super-Resolution (NegVSR) task. Specifically, we first propose sequential noise generation toward real-world data to extract practical noise sequences. Then, the degeneration domain is widely expanded by negative augmentation to build up various yet challenging real-world noise sets. We further propose the augmented negative guidance loss to learn robust features among augmented negatives effectively. Extensive experiments on real-world datasets (e.g., VideoLQ and FLIR) show that our method outperforms state-of-the-art methods with clear margins, especially in visual quality. Project page is available at: https://negvsr.github.io/.

Published

2024-03-24

How to Cite

Song, Y., Wang, M., Yang, Z., Xian, X., & Shi, Y. (2024). NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-world Video Super-Resolution. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10705-10713. https://doi.org/10.1609/aaai.v38i9.28942

Issue

Section

AAAI Technical Track on Knowledge Representation and Reasoning