Differential Privacy for Text Analytics via Natural Text Sanitization
Xiang Yue, Minxin Du, Tianhao Wang, Yaliang Li, Huan Sun, Sherman S. M. Chow
TL;DR
This work tackles the challenge of privacy-preserving NLP by moving beyond noisy representations to directly sanitizing text documents. It introduces Utility-optimized MLDP (UMLDP) and token-wise sanitization mechanisms (SanText and SanText+) to replace tokens with semantically similar alternatives under local DP, enabling human-readable privacy guarantees. The authors integrate sanitization-aware pretraining and fine-tuning on sanitized data, demonstrating improved utility and robust privacy against inference attacks across SST-2, MedSTS, and QNLI. Experiments show SanText+ delivers superior utility and efficiency compared with baselines, and sanitization-aware pretraining yields further gains without compromising privacy. Overall, the approach provides a practical, scalable pathway for privacy-preserving NLP pipelines with transparent sanitization and interpretable privacy guarantees.
Abstract
Texts convey sophisticated knowledge. However, texts also convey sensitive information. Despite the success of general-purpose language models and domain-specific mechanisms with differential privacy (DP), existing text sanitization mechanisms still provide low utility, as cursed by the high-dimensional text representation. The companion issue of utilizing sanitized texts for downstream analytics is also under-explored. This paper takes a direct approach to text sanitization. Our insight is to consider both sensitivity and similarity via our new local DP notion. The sanitized texts also contribute to our sanitization-aware pretraining and fine-tuning, enabling privacy-preserving natural language processing over the BERT language model with promising utility. Surprisingly, the high utility does not boost up the success rate of inference attacks.
