SDAT: Sub-Dataset Alternation Training for Improved Image Demosaicing
Yuval Becker, Raz Z. Nossek, Tomer Peleg
TL;DR
SDAT introduces Sub-Dataset Alternation Training to reduce dataset-induced bias in image demosaicing by alternating learning between biased sub-datasets and the full dataset. The method identifies diverse sub-datasets using artifact metrics and selects training phases based on the minimum average sub-dataset validation loss, formalized as $\bar{V}_t = \frac{1}{N}\sum_{i=1}^{N} V_{w_t,c_i}$. Across both low- and high-capacity architectures, including CNNs and transformers, SDAT yields consistent performance gains and achieves state-of-the-art results on three popular demosaicing benchmarks. The approach emphasizes data-centric training dynamics, demonstrating that curated bias diversity in sub-datasets can enhance generalization, with practical implications for edge devices and broader image restoration tasks.
Abstract
Image demosaicing is an important step in the image processing pipeline for digital cameras. In data centric approaches, such as deep learning, the distribution of the dataset used for training can impose a bias on the networks' outcome. For example, in natural images most patches are smooth, and high-content patches are much rarer. This can lead to a bias in the performance of demosaicing algorithms. Most deep learning approaches address this challenge by utilizing specific losses or designing special network architectures. We propose a novel approach, SDAT, Sub-Dataset Alternation Training, that tackles the problem from a training protocol perspective. SDAT is comprised of two essential phases. In the initial phase, we employ a method to create sub-datasets from the entire dataset, each inducing a distinct bias. The subsequent phase involves an alternating training process, which uses the derived sub-datasets in addition to training also on the entire dataset. SDAT can be applied regardless of the chosen architecture as demonstrated by various experiments we conducted for the demosaicing task. The experiments are performed across a range of architecture sizes and types, namely CNNs and transformers. We show improved performance in all cases. We are also able to achieve state-of-the-art results on three highly popular image demosaicing benchmarks.
