NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-World Video Super-Resolution
Yexing Song, Meilin Wang, Zhijing Yang, Xiaoyu Xian, Yukai Shi
TL;DR
NegVSR addresses real-world video super-resolution under unknown degradation by modeling generalized noise with sequential noise sequences and negative augmentations. It introduces NegMix, which combines real-world noise with LR frames and applies patch-based rotation; it uses Augmented Positive and Augmented Negative guidance losses to enforce consistency and denoise robustness. The approach expands the degradation domain beyond classical kernels and demonstrates superior performance on VideoLQ and FLIR datasets, with ablations validating the components. The work highlights the importance of preserving sequential noise structure in video VSR and proposes a practical, out-of-distribution noise modeling framework. It also notes potential improvements in inference speed with lightweight architectures.
Abstract
The capability of video super-resolution (VSR) to synthesize high-resolution (HR) video from ideal datasets has been demonstrated in many works. However, applying the VSR model to real-world video with unknown and complex degradation remains a challenging task. First, existing degradation metrics in most VSR methods are not able to effectively simulate real-world noise and blur. On the contrary, simple combinations of classical degradation are used for real-world noise modeling, which led to the VSR model often being violated by out-of-distribution noise. Second, many SR models focus on noise simulation and transfer. Nevertheless, the sampled noise is monotonous and limited. To address the aforementioned problems, we propose a Negatives augmentation strategy for generalized noise modeling in Video Super-Resolution (NegVSR) task. Specifically, we first propose sequential noise generation toward real-world data to extract practical noise sequences. Then, the degeneration domain is widely expanded by negative augmentation to build up various yet challenging real-world noise sets. We further propose the augmented negative guidance loss to learn robust features among augmented negatives effectively. Extensive experiments on real-world datasets (e.g., VideoLQ and FLIR) show that our method outperforms state-of-the-art methods with clear margins, especially in visual quality. Project page is available at: https://negvsr.github.io/.
