Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

Francisco Aguilera-Martínez; Fernando Berzal

Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

Francisco Aguilera-Martínez, Fernando Berzal

TL;DR

The paper addresses privacy leakage from training data in neural networks and critiques gradient-noise based DP approaches like DP-SGD for potentially diminishing utility. It introduces PDP-SGD, a loss-regularization strategy where privacy is enforced through an input- and parameter-dependent regularization term rather than explicit gradient perturbation. The authors argue that the PDP regularization captures DP effects via a proportional input-dependent term that can be integrated with conventional L2 regularization, potentially improving the privacy–utility trade-off. They also highlight possible efficiency gains by avoiding explicit gradient noise and maintaining compatibility with standard SGD optimizers, thereby broadening differential privacy applications in large models and LLMs.

Abstract

Training machine learning models based on neural networks requires large datasets, which may contain sensitive information. The models, however, should not expose private information from these datasets. Differentially private SGD [DP-SGD] requires the modification of the standard stochastic gradient descent [SGD] algorithm for training new models. In this short paper, a novel regularization strategy is proposed to achieve the same goal in a more efficient manner.

Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

TL;DR

Abstract

Paper Structure (14 sections, 47 equations)

This paper contains 14 sections, 47 equations.

Introduction
Background
Differential Privacy in Deep Neural Networks
Differential Privacy in LLMs: The EW-Tune Framework
User-Level Differential Privacy in LLMs
Differential Privacy through Classic Regularization
A New Perspective on the Differentially Private SGD Algorithm
Differentially Private Regularization
Expectations (and variances)
Linearity of expectations
Non-multiplicativity of expectations
Normal distributions
Product of normal distributions
Square of normal distributions

Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

TL;DR

Abstract

Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

Authors

TL;DR

Abstract

Table of Contents