DeCompress: Denoising via Neural Compression
Ali Zafari, Xi Chen, Shirin Jalali
TL;DR
DeCompress addresses denoising when ground-truth clean images are unavailable and large paired datasets are impractical. It formulates denoising as learning a neural compression code by minimizing a rate-distortion objective $\mathbb{E}\left[ \frac{1}{n}\|\hat{Y}^n - Y^n\|_2^2 \right] + \lambda R$, where $R = \mathbb{E}[-\log_2 P(\lfloor C^m \rceil)]$ and $\hat{Y}^n = g_s(\lfloor C^m \rceil)$ with bottleneck $C^m = g_a(Y^n)$. Training uses uniform-noise augmentation to approximate quantization and an entropy model to estimate $R$, enabling learning from noisy data only. The results show DeCompress achieving competitive denoising performance against zero-shot baselines while requiring only a single noisy image or limited data, highlighting a practical compression-based paradigm for denoising in data-scarce settings. This approach potentially extends to other inverse problems by coupling neural compression with adaptive rate–distortion control.
Abstract
Learning-based denoising algorithms achieve state-of-the-art performance across various denoising tasks. However, training such models relies on access to large training datasets consisting of clean and noisy image pairs. On the other hand, in many imaging applications, such as microscopy, collecting ground truth images is often infeasible. To address this challenge, researchers have recently developed algorithms that can be trained without requiring access to ground truth data. However, training such models remains computationally challenging and still requires access to large noisy training samples. In this work, inspired by compression-based denoising and recent advances in neural compression, we propose a new compression-based denoising algorithm, which we name DeCompress, that i) does not require access to ground truth images, ii) does not require access to large training dataset - only a single noisy image is sufficient, iii) is robust to overfitting, and iv) achieves superior performance compared with zero-shot or unsupervised learning-based denoisers.
