MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

Jeffrey A. Chan-Santiago; Praveen Tirupattur; Gaurav Kumar Nayak; Gaowen Liu; Mubarak Shah

MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

Jeffrey A. Chan-Santiago, Praveen Tirupattur, Gaurav Kumar Nayak, Gaowen Liu, Mubarak Shah

TL;DR

This work tackles dataset distillation by extracting diverse, representative samples from a fixed, pre-trained diffusion model without distillation-loss fine-tuning. It introduces Mode Discovery to identify data modes, Mode Guidance to steer sampling toward each mode during the diffusion denoising process, and Stop Guidance to preserve sample quality while maintaining diversity. The approach achieves state-of-the-art accuracy across ImageNette, ImageIDC, ImageNet-100, and ImageNet-1K under hard- and soft-label protocols, while significantly reducing computational cost compared with fine-tuning-based methods. It further demonstrates compatibility with multiple diffusion backbones, including text-to-image models, broadening practical deployments for efficient dataset distillation in resource-constrained settings.

Abstract

Dataset distillation has emerged as an effective strategy, significantly reducing training costs and facilitating more efficient model deployment. Recent advances have leveraged generative models to distill datasets by capturing the underlying data distribution. Unfortunately, existing methods require model fine-tuning with distillation losses to encourage diversity and representativeness. However, these methods do not guarantee sample diversity, limiting their performance. We propose a mode-guided diffusion model leveraging a pre-trained diffusion model without the need to fine-tune with distillation losses. Our approach addresses dataset diversity in three stages: Mode Discovery to identify distinct data modes, Mode Guidance to enhance intra-class diversity, and Stop Guidance to mitigate artifacts in synthetic samples that affect performance. Our approach outperforms state-of-the-art methods, achieving accuracy gains of 4.4%, 2.9%, 1.6%, and 1.6% on ImageNette, ImageIDC, ImageNet-100, and ImageNet-1K, respectively. Our method eliminates the need for fine-tuning diffusion models with distillation losses, significantly reducing computational costs. Our code is available on the project webpage: https://jachansantiago.github.io/mode-guided-distillation/

MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

TL;DR

Abstract

MGD$^3$: Mode-Guided Dataset Distillation using Diffusion Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)