Locally Differentially Private In-Context Learning
Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixin Jiang, Shaoyang Song, Chunlai Zhou
TL;DR
This work addresses privacy risks in in-context learning by LLMs when demonstrations may contain sensitive labels. It introduces LDP-ICL, a locally differentially private framework that perturbs demonstration labels with a $k$-ary randomized response and analyzes the resulting privacy–utility trade-off via a gradient-descent perspective on ICL. The authors derive a test-prediction formula under noisy demonstrations, connect ICL to linear self-attention, and extend the framework to discrete distribution estimation. Empirical results across multiple datasets show that LDP-ICL can closely match non-private ICL for moderate privacy budgets while providing strong privacy guarantees, and that it can outperform Warner’s mechanism in high-privacy regimes for distribution estimation. The work also provides templates, proofs, and extensive ablations to support the proposed approach and outlines future directions for refining demonstration selection and privatizing additional components of the ICL pipeline.
Abstract
Large pretrained language models (LLMs) have shown surprising In-Context Learning (ICL) ability. An important application in deploying large language models is to augment LLMs with a private database for some specific task. The main problem with this promising commercial use is that LLMs have been shown to memorize their training data and their prompt data are vulnerable to membership inference attacks (MIA) and prompt leaking attacks. In order to deal with this problem, we treat LLMs as untrusted in privacy and propose a locally differentially private framework of in-context learning(LDP-ICL) in the settings where labels are sensitive. Considering the mechanisms of in-context learning in Transformers by gradient descent, we provide an analysis of the trade-off between privacy and utility in such LDP-ICL for classification. Moreover, we apply LDP-ICL to the discrete distribution estimation problem. In the end, we perform several experiments to demonstrate our analysis results.
