DANCE: Dual-View Distribution Alignment for Dataset Condensation
Hansong Zhang, Shikun Li, Fanzhao Lin, Weiping Wang, Zhenxing Qian, Shiming Ge
TL;DR
DANCE addresses the efficiency-accuracy gap in dataset condensation by augmenting Distribution Matching with two expert-guided views. The inner-class view employs Pseudo Long-Term Distribution Alignment (PLTDA) using middle encoders to maintain faithful class representations throughout training, while the inter-class view uses Distribution Calibration with expert models to keep synthetic data within real class regions. Empirically, DANCE achieves state-of-the-art results across a range of datasets and resolutions, with notable cross-architecture generalization and improved condensation efficiency. This dual-view framework significantly reduces data burdens for practical, large-scale model development while maintaining competitive performance.
Abstract
Dataset condensation addresses the problem of data burden by learning a small synthetic training set that preserves essential knowledge from the larger real training set. To date, the state-of-the-art (SOTA) results are often yielded by optimization-oriented methods, but their inefficiency hinders their application to realistic datasets. On the other hand, the Distribution-Matching (DM) methods show remarkable efficiency but sub-optimal results compared to optimization-oriented methods. In this paper, we reveal the limitations of current DM-based methods from the inner-class and inter-class views, i.e., Persistent Training and Distribution Shift. To address these problems, we propose a new DM-based method named Dual-view distribution AligNment for dataset CondEnsation (DANCE), which exploits a few pre-trained models to improve DM from both inner-class and inter-class views. Specifically, from the inner-class view, we construct multiple "middle encoders" to perform pseudo long-term distribution alignment, making the condensed set a good proxy of the real one during the whole training process; while from the inter-class view, we use the expert models to perform distribution calibration, ensuring the synthetic data remains in the real class region during condensing. Experiments demonstrate the proposed method achieves a SOTA performance while maintaining comparable efficiency with the original DM across various scenarios. Source codes are available at https://github.com/Hansong-Zhang/DANCE.
