Table of Contents
Fetching ...

DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer

Junyuan Hong, Jiachen T. Wang, Chenhui Zhang, Zhangheng Li, Bo Li, Zhangyang Wang

TL;DR

This paper introduces the first private prompt generation mechanism, by a differentially-private (DP) ensemble of in-context learning with private demonstrations, and demonstrates that prompts suggested by LLMs themselves can be transferred without compromising performance significantly.

Abstract

Large Language Models (LLMs) have emerged as dominant tools for various tasks, particularly when tailored for a specific target by prompt tuning. Nevertheless, concerns surrounding data privacy present obstacles due to the tuned prompts' dependency on sensitive private information. A practical solution is to host a local LLM and optimize a soft prompt privately using data. Yet, hosting a local model becomes problematic when model ownership is protected. Alternative methods, like sending data to the model's provider for training, intensify these privacy issues facing an untrusted provider. In this paper, we present a novel solution called Differentially-Private Offsite Prompt Tuning (DP-OPT) to address this challenge. Our approach involves tuning a discrete prompt on the client side and then applying it to the desired cloud models. We demonstrate that prompts suggested by LLMs themselves can be transferred without compromising performance significantly. To ensure that the prompts do not leak private information, we introduce the first private prompt generation mechanism, by a differentially-private (DP) ensemble of in-context learning with private demonstrations. With DP-OPT, generating privacy-preserving prompts by Vicuna-7b can yield competitive performance compared to non-private in-context learning on GPT3.5 or local private prompt tuning. Codes are available at https://github.com/VITA-Group/DP-OPT .

DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer

TL;DR

This paper introduces the first private prompt generation mechanism, by a differentially-private (DP) ensemble of in-context learning with private demonstrations, and demonstrates that prompts suggested by LLMs themselves can be transferred without compromising performance significantly.

Abstract

Large Language Models (LLMs) have emerged as dominant tools for various tasks, particularly when tailored for a specific target by prompt tuning. Nevertheless, concerns surrounding data privacy present obstacles due to the tuned prompts' dependency on sensitive private information. A practical solution is to host a local LLM and optimize a soft prompt privately using data. Yet, hosting a local model becomes problematic when model ownership is protected. Alternative methods, like sending data to the model's provider for training, intensify these privacy issues facing an untrusted provider. In this paper, we present a novel solution called Differentially-Private Offsite Prompt Tuning (DP-OPT) to address this challenge. Our approach involves tuning a discrete prompt on the client side and then applying it to the desired cloud models. We demonstrate that prompts suggested by LLMs themselves can be transferred without compromising performance significantly. To ensure that the prompts do not leak private information, we introduce the first private prompt generation mechanism, by a differentially-private (DP) ensemble of in-context learning with private demonstrations. With DP-OPT, generating privacy-preserving prompts by Vicuna-7b can yield competitive performance compared to non-private in-context learning on GPT3.5 or local private prompt tuning. Codes are available at https://github.com/VITA-Group/DP-OPT .
Paper Structure (20 sections, 4 theorems, 4 equations, 6 figures, 12 tables, 4 algorithms)

This paper contains 20 sections, 4 theorems, 4 equations, 6 figures, 12 tables, 4 algorithms.

Key Result

Theorem A.1

The exponential mechanism is ${\epsilon}$-DP, and $(\alpha, {\epsilon}_{EM}(\alpha))$-RDP s.t. ${\epsilon}_{EM}(\alpha) := \frac{\alpha}{2} {\epsilon}^2$.

Figures (6)

  • Figure 1: Differentially-Private Offsite Prompt Tuning (DP-OPT) works as an intermediate layer between local data and cloud models. Leveraging a local model, DP-OPT can fine-tune a differentially-private prompt that can transfer to the target model.
  • Figure 2: A DLN-1-generated prompt is coherent but suffers from privacy leakage. We highlight the potential leakage in the prompt and semantically-nearest $\blacktriangleright$ [leaked sample] from the training set.
  • Figure 3: Examples of Generated Prompts. OPT/DP-OPT tends to generate pseudo examples (blue text) which do not belong to the training set. We highlight potentially-leaked samples and semantically-nearest retrieved $\blacktriangleright$ [training samples] (there might be multiple such samples). More examples are in \ref{['tab:generated_prompts']}.
  • Figure 4: Privacy-utility trade-off of DP-OPT. Smaller $\epsilon$ indicates stricter privacy protection.
  • Figure 5: Templates for DLN-1, OPT and DP-OPT.
  • ...and 1 more figures

Theorems & Definitions (11)

  • Definition 3.1: Differential Privacy dwork2006calibrating
  • Definition A.1: Rényi Differential Privacy mironov2017renyi
  • Definition A.2: Approximate RDP bun2016concentratedzhu2022adaptive
  • Definition A.3: Exponential Mechanism mcsherry2007mechanism
  • Theorem A.1: bun2016concentrated
  • Theorem A.2
  • proof
  • Theorem A.3
  • proof
  • Definition A.4: Monotonic Utility Function
  • ...and 1 more