Cascading Refinement Video Denoising with Uncertainty Adaptivity
Xinyuan Yu
TL;DR
The paper tackles video denoising under realistic, multi-level noise where alignment accuracy is critical for restoration. It introduces a cascading refinement framework that jointly refines optical-flow-based alignment and frame restoration, augmented by an uncertainty map after each iteration to guide early stopping. Key innovations include pre-denoising with patch matching, a RAFT-inspired iterative flow estimator with pyramid correlation, a flow-guided deformable convolution-based reconstruction network, and an uncertainty-adaptive loss that reduces computation while achieving state-of-the-art results on the CRVD dataset. The approach enhances robustness to varying noise levels and offers practical gains for real-world video analysis and downstream tasks.
Abstract
Accurate alignment is crucial for video denoising. However, estimating alignment in noisy environments is challenging. This paper introduces a cascading refinement video denoising method that can refine alignment and restore images simultaneously. Better alignment enables restoration of more detailed information in each frame. Furthermore, better image quality leads to better alignment. This method has achieved SOTA performance by a large margin on the CRVD dataset. Simultaneously, aiming to deal with multi-level noise, an uncertainty map was created after each iteration. Because of this, redundant computation on the easily restored videos was avoided. By applying this method, the entire computation was reduced by 25% on average.
