SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance

Suzanne Stathatos; Michael Hobley; Pietro Perona; Markus Marks

SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance

Suzanne Stathatos, Michael Hobley, Pietro Perona, Markus Marks

TL;DR

SAVeD tackles denoising in low-SNR video domains (e.g., underwater sonar, ultrasound, microscopy) where clean ground-truth data are unavailable. It introduces a self-supervised framework that amplifies foreground motion through a three-frame reconstruction target and a lightweight encoder–bottleneck–decoder, producing denoised frames without clean targets. A key contribution is the Foreground-to-Background Divergence (FBD) metric, enabling unsupervised evaluation aligned with downstream task performance. The method achieves state-of-the-art gains across classification, detection, tracking, and counting on diverse datasets while requiring fewer training resources than prior denoising approaches, suggesting broad applicability in scientific and medical imaging contexts.

Abstract

Low signal-to-noise ratio videos -- such as those from underwater sonar, ultrasound, and microscopy -- pose significant challenges for computer vision models, particularly when paired clean imagery is unavailable. We present Spatiotemporal Augmentations and denoising in Video for Downstream Tasks (SAVeD), a novel self-supervised method that denoises low-SNR sensor videos using only raw noisy data. By leveraging distinctions between foreground and background motion and exaggerating objects with stronger motion signal, SAVeD enhances foreground object visibility and reduces background and camera noise without requiring clean video. SAVeD has a set of architectural optimizations that lead to faster throughput, training, and inference than existing deep learning methods. We also introduce a new denoising metric, FBD, which indicates foreground-background divergence for detection datasets without requiring clean imagery. Our approach achieves state-of-the-art results for classification, detection, tracking, and counting tasks, and it does so with fewer training resource requirements than existing deep-learning-based denoising methods. Project page: https://suzanne-stathatos.github.io/SAVeD Code page: https://github.com/suzanne-stathatos/SAVeD

SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance

TL;DR

Abstract

SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)