A Study in Dataset Distillation for Image Super-Resolution
Tobias Dietz, Brian B. Moser, Tobias Nauen, Federico Raue, Stanislav Frolov, Andreas Dengel
TL;DR
This paper tackles data efficiency in image super-resolution by applying dataset distillation to a regression task. It adapts gradient matching from Dataset Condensation to SR by replacing the classification loss with reconstruction loss $\ell_{SR}$ and introduces pseudo-label grouping and latent-space distillation using a pretrained generator such as StyleGAN-XL. The experiments show that a distilled dataset occupying about $8.88\%$ of the original size can train SR models with near full-data fidelity, with latent-space distillation outperforming pixel-space methods and generalizing across $2\times$ and $4\times$ upscaling. The work provides a practical path toward memory- and compute-efficient SR pipelines and offers foundational insights for generative data learning in restoration tasks.
Abstract
Dataset distillation aims to compress large datasets into compact yet highly informative subsets that preserve the training behavior of the original data. While this concept has gained traction in classification, its potential for image Super-Resolution (SR) remains largely untapped. In this work, we conduct the first systematic study of dataset distillation for SR, evaluating both pixel- and latent-space formulations. We show that a distilled dataset, occupying only 8.88% of the original size, can train SR models that retain nearly the same reconstruction fidelity as those trained on full datasets. Furthermore, we analyze how initialization strategies and distillation objectives affect efficiency, convergence, and visual quality. Our findings highlight the feasibility of SR dataset distillation and establish foundational insights for memory- and compute-efficient generative restoration models.
