Differential Privacy in Continual Learning: Which Labels to Update?
Marlon Tobaben, Talal Alrawajfeh, Marcus Klasson, Mikko Heikkilä, Arno Solin, Antti Honkela
TL;DR
This work exposes a privacy vulnerability in differential-privacy–enabled continual learning: releasing the classifier's output space can leak information about sensitive data. It formalizes task-wise DP for continual learning and proposes two data-independent strategies for the output label space—Sprior and Slearned—to preserve DP guarantees while maintaining utility. The authors instantiate DP-CL with pre-trained-model approaches, developing a Cosine Similarity Classifier and a PEFT Ensemble, and demonstrate robust performance across varied priors, blurry tasks, and domain shifts. The results offer practical guidance for privacy-preserving CL, enabling effective use of large label taxonomies and pre-trained representations under DP constraints.
Abstract
The goal of continual learning (CL) is to retain knowledge across tasks, but this conflicts with strict privacy required for sensitive training data that prevents storing or memorising individual samples. To address that, we combine CL and differential privacy (DP). We highlight that failing to account for privacy leakage through the set of labels a model can output can break the privacy of otherwise valid DP algorithms. This is especially relevant in CL. We show that mitigating the issue with a data-independent overly large label space can have minimal negative impact on utility when fine-tuning a pre-trained model under DP, while learning the labels with a separate DP mechanism risks losing small classes.
