Differentially Private Two-Stage Empirical Risk Minimization and Applications to Individualized Treatment Rule

Joowon Lee; Guanhua Chen

Differentially Private Two-Stage Empirical Risk Minimization and Applications to Individualized Treatment Rule

Joowon Lee, Guanhua Chen

TL;DR

This work addresses end-to-end privacy in two-stage empirical risk minimization where a data-dependent first stage (covariate-balancing weights) feeds into a second-stage weighted ERM. The authors propose DP-2ERM, which preserves first-stage weights non-privately and applies tailored objective perturbation noise to the second stage, enabling end-to-end ${\\epsilon}$-DP or $(\\epsilon,\\delta)$-DP guarantees. They develop deterministic stability bounds for weight perturbations across IPW, MMD, and EBW schemes, along with probabilistic sensitivity bounds for the private estimator, yielding favorable privacy-utility trade-offs compared with standard composition. Substantial theoretical contributions include perturbation bounds, stability results, and a utility analysis showing near-optimal private performance in simulations and real-data ITR applications. Empirically, DP-2ERM with entropy balancing and kernel-based weights demonstrates notable improvements over baselines, underscoring the practical impact for private individualized treatment rule learning and related causal-inference pipelines.

Abstract

Differential Privacy (DP) provides a rigorous framework for deriving privacy-preserving estimators by injecting calibrated noise to mask individual contributions while preserving population-level insights. Its central challenge lies in the privacy-utility trade-off: calibrating noise levels to ensure robust protection without compromising statistical performance. Standard DP methods struggle with a particular class of two-stage problems prevalent in individualized treatment rules (ITRs) and causal inference. In these settings, data-dependent weights are first computed to satisfy distributional constraints, such as covariate balance, before the final parameter of interest is estimated. Current DP approaches often privatize stages independently, which either degrades weight efficacy-leading to biased and inconsistent estimates-or introduces excessive noise to account for worst-case scenarios. To address these challenges, we propose the Differentially Private Two-Stage Empirical Risk Minimization (DP-2ERM), a framework that injects a carefully calibrated noise only into the second stage while maintaining privacy for the entire pipeline and preserving the integrity of the first stage weights. Our theoretical contributions include deterministic bounds on weight perturbations across various widely used weighting methods, and probabilistic bounds on sensitivity for the final estimator. Simulations and real-world applications in ITR demonstrate that DP-2ERM significantly enhances utility over existing methods while providing rigorous privacy guarantees.

Differentially Private Two-Stage Empirical Risk Minimization and Applications to Individualized Treatment Rule

TL;DR

-DP or

-DP guarantees. They develop deterministic stability bounds for weight perturbations across IPW, MMD, and EBW schemes, along with probabilistic sensitivity bounds for the private estimator, yielding favorable privacy-utility trade-offs compared with standard composition. Substantial theoretical contributions include perturbation bounds, stability results, and a utility analysis showing near-optimal private performance in simulations and real-data ITR applications. Empirically, DP-2ERM with entropy balancing and kernel-based weights demonstrates notable improvements over baselines, underscoring the practical impact for private individualized treatment rule learning and related causal-inference pipelines.

Abstract

Paper Structure (33 sections, 19 theorems, 44 equations, 2 figures, 1 table, 1 algorithm)

This paper contains 33 sections, 19 theorems, 44 equations, 2 figures, 1 table, 1 algorithm.

Introduction
Differential Privacy and Weighted Empirical Risk Minimization
Differentially Private Two-Stage ERM
Contributions
Background and Related Work
Differential Privacy
Differentially Private Empirical Risk Minimization
Covariate Balancing Weights and ITR
Differentially Private Causal Inference
More on Limitations on Using Composition Theorems for 2ERM
Statement of Results
Notations
Problem Formulation: Two-Stage ERM
Proposed Method: The DP-2ERM Algorithm
General Results on DP-2ERM
...and 18 more sections

Key Result

Theorem 3.1

Fix size-$n$ datasets $\mathcal{D}$ and $\mathcal{D}'$ that differ by at most one point and let $\mathbf{w}=(w_{1},\dots,w_{n})$ and $\mathbf{w}'=(w_{1}',\dots,w_{n}')$ be the non-private optimal sample weights solving eq:find_optimal_weight for the datasets $\mathcal{D}$ and $\mathcal{D}'$, respect Suppose there exist $\lambda,\zeta$ such that $\textup{tr}(\nabla^{2} \ell(\boldsymbol{\theta};\mat

Figures (2)

Figure 1: Differentially Private Two-Stage ERM Framework. Our approach maintains non-private weight computation in the first stage to effectively address covariate shift, while applying privacy mechanisms exclusively to the second stage where weighted ERM is solved. This design is critical because standard composition theorems require calibrating noise against worst-case sequential dependencies to guarantee privacy, resulting in excessive noise injection that destroys utility. Our method ensures end-to-end differential privacy while preserving the effectiveness of the weights for balancing covariates.
Figure 2: Privacy--utility tradeoff across scenarios. Average test-set ITR accuracy (100 replicates) for DP-2ERM with IPW (blue), EBW (green), and MMD weights (red) under Gamma objective perturbation (top; $\epsilon$-DP) and Gaussian objective perturbation (bottom; $(\epsilon,\delta)$-DP). Solid squares denote the strong-composition baseline; dashed lines denote the non-private linear-ITR baseline within the chosen decision-function class.

Theorems & Definitions (20)

Definition 2.1: Differential Privacy
Theorem 3.1: Data Sensitivity Bound for DP-2ERM via Objective Perturbation
Theorem 3.2: Differential Privacy of DP-2ERM
Corollary 3.3: Differential Privacy of Data-Independent Weighted ERM
Corollary 3.4: Universal Differential Privacy of 2ERM via Objective Perturbation
Theorem 3.5: Utility Guarantees for DP-2ERM
Theorem 4.1: General Stability of Parameterized Weights
Theorem 4.2: Stability of Covariate Balancing Weights
Corollary 4.3: Improved Privacy Guarantees of DP-2ERM with Covariate Balancing Weights
Proposition 4.4: Stability of IPWs under Randomized Trial
...and 10 more

Differentially Private Two-Stage Empirical Risk Minimization and Applications to Individualized Treatment Rule

TL;DR

Abstract

Differentially Private Two-Stage Empirical Risk Minimization and Applications to Individualized Treatment Rule

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (20)