Differentially Private In-context Learning via Sampling Few-shot Mixed with Zero-shot Outputs
James Flemings, Haosheng Gan, Hongyi Li, Meisam Razaviyayn, Murali Annavaram
TL;DR
This work tackles privacy leakage in in-context learning by enabling differentially private generation without added noise. It introduces DPS-MOZO, a decoding framework that mixes subsampled one-shot demonstrations with a zero-shot distribution via adaptive lambda parameters to bound Renyi divergence and achieve DP. By operating in online (On) or offline (Off) modes, DPS-MOZO provides strong privacy guarantees (e.g., $\epsilon=2$) with minimal utility loss on multiple generation benchmarks, often surpassing existing DP-ICL baselines. The approach leverages privacy amplification by subsampling and the post-processing property of DP, offering a practical path toward privacy-preserving LLM-based ICL in real-world deployment.
Abstract
In-context learning (ICL) has shown promising improvement in downstream task adaptation of LLMs by augmenting prompts with relevant input-output examples (demonstrations). However, the ICL demonstrations can contain privacy-sensitive information, which can be leaked and/or regurgitated by the LLM output. Differential Privacy (DP), a widely adopted privacy safeguard, has emerged to mitigate this privacy leakage, with recent work demonstrating strong privacy-utility tradeoffs in classification tasks for ICL. However, generation tasks for ICL are challenging due to the high-dimensional output space of open-ended generation. To this end, we propose $\texttt{dps-mozo}$, Differentially Private Sampling by Mixing One-shot with Zero-shot Outputs, a decoding framework that generates DP text by sampling from the product of multiple one-shot outputs mixed with a zero-shot output. This mixing effectively reduces the amount of information that can be leaked by each demonstration. By utilizing the inherent randomness in sampling from the mixed distributions, we can achieve DP without adding noise, thereby improving the privacy-utility tradeoff. Our experimental evaluations show $\texttt{dps-mozo}$ can achieve a strong privacy guarantee, $ε=2$, with minimal utility degradation compared to non-private few-shot learning, $\textbf{0.3}$% ROUGE-L F1 score decrease on the SAMSum dataset with Gemma 2 2B.
