Deep Classifier Mimicry without Data Access
Steven Braun, Martin Mundt, Kristian Kersting
TL;DR
CAKE addresses the challenge of distilling knowledge from a trained teacher without access to the original training data by generating synthetic pairs and diffusing them toward the teacher's decision boundary using a contrastive objective. The method is model-agnostic and incorporates noise and data priors to ensure broad boundary coverage, enabling effective student learning across architectures and scales. Ablation, cross-model, and cross-dataset experiments show CAKE achieves competitive or superior performance relative to data-free baselines while approaching data-dependent baselines with a modest gap. This approach has practical implications for privacy-preserving model updates and cross-domain knowledge transfer, though it also raises concerns about potential misuse and biases in transferable knowledge.
Abstract
Access to pre-trained models has recently emerged as a standard across numerous machine learning domains. Unfortunately, access to the original data the models were trained on may not equally be granted. This makes it tremendously challenging to fine-tune, compress models, adapt continually, or to do any other type of data-driven update. We posit that original data access may however not be required. Specifically, we propose Contrastive Abductive Knowledge Extraction (CAKE), a model-agnostic knowledge distillation procedure that mimics deep classifiers without access to the original data. To this end, CAKE generates pairs of noisy synthetic samples and diffuses them contrastively toward a model's decision boundary. We empirically corroborate CAKE's effectiveness using several benchmark datasets and various architectural choices, paving the way for broad application.
