Private Text Generation by Seeding Large Language Model Prompts
Supriya Nagesh, Justin Y. Chen, Nina Mishra, Tal Wagner
TL;DR
This work tackles the challenge of sharing private text data for machine learning by enabling private synthetic text generation through prompting a large language model, without exposing sensitive data. It introduces DP-KPS, which seeds LLM prompts with sequences of privatized keyphrases drawn from a DP KDE over private embeddings, allocating privacy budgets $\varepsilon_{\mathrm{voc}}$ and $\varepsilon_{\mathrm{kde}}$ (with total $\varepsilon_{\mathrm{total}}=\varepsilon_{\mathrm{voc}}+\varepsilon_{\mathrm{kde}}$). The approach combines a privatized vocabulary, high-dimensional DP KDE sampling, and either independent or iterative keyphrase sequence generation, followed by LLM prompting and domain adaptation to produce DP-compliant synthetic texts. Empirical results on MIMIC medical notes and DBPedia-14 demonstrate that DP-KPS can preserve substantial predictive power under DP constraints and outperform certain prompt-based baselines while using far fewer LLM prompts; ablations show the benefits of domain adaptation and few-shot prompting. Overall, DP-KPS offers a practical pathway for privacy-preserving data sharing in regulated domains, enabling downstream ML while acknowledging risks like hallucinations and distribution shifts that require careful validation and governance.
Abstract
We explore how private synthetic text can be generated by suitably prompting a large language model (LLM). This addresses a challenge for organizations like hospitals, which hold sensitive text data like patient medical records, and wish to share it in order to train machine learning models for medical tasks, while preserving patient privacy. Methods that rely on training or finetuning a model may be out of reach, either due to API limits of third-party LLMs, or due to ethical and legal prohibitions on sharing the private data with the LLM itself. We propose Differentially Private Keyphrase Prompt Seeding (DP-KPS), a method that generates a private synthetic text corpus from a sensitive input corpus, by accessing an LLM only through privatized prompts. It is based on seeding the prompts with private samples from a distribution over phrase embeddings, thus capturing the input corpus while achieving requisite output diversity and maintaining differential privacy. We evaluate DP-KPS on downstream ML text classification tasks, and show that the corpora it generates preserve much of the predictive power of the original ones. Our findings offer hope that institutions can reap ML insights by privately sharing data with simple prompts and little compute.
