Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics

Benjamin Sterling; Yousef El-Laham; Mónica F. Bugallo

Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics

Benjamin Sterling, Yousef El-Laham, Mónica F. Bugallo

TL;DR

This work addresses membership inference attacks on diffusion models and proposes a defense based on critically-damped higher-order Langevin dynamics (HOLD++) that introduces auxiliary variables into the forward diffusion to inject randomness early in the process. It provides a theoretical analysis showing HOLD++ achieves Rényi differential privacy with bounds that depend on the initial variance parameter $\epsilon_{\text{num}}$ and the model order, while also relying on the non-deterministic score to further deter attacks. Empirically, the authors validate the approach on a toy dataset and LJ Speech, demonstrating that higher model orders $n$ and larger variance factors $\beta$ reduce membership leakage as measured by AUROC, with FID used to assess sample quality. The results suggest a favorable privacy-utility trade-off for HOLD++ compared to standard DP approaches, and the authors release code for reproducibility.

Abstract

Recent advances in generative artificial intelligence applications have raised new data security concerns. This paper focuses on defending diffusion models against membership inference attacks. This type of attack occurs when the attacker can determine if a certain data point was used to train the model. Although diffusion models are intrinsically more resistant to membership inference attacks than other generative models, they are still susceptible. The defense proposed here utilizes critically-damped higher-order Langevin dynamics, which introduces several auxiliary variables and a joint diffusion process along these variables. The idea is that the presence of auxiliary variables mixes external randomness that helps to corrupt sensitive input data earlier on in the diffusion process. This concept is theoretically investigated and validated on a toy dataset and a speech dataset using the Area Under the Receiver Operating Characteristic (AUROC) curves and the FID metric.

Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics

TL;DR

Abstract

Defending Diffusion Models Against Membership Inference Attacks via Higher-Order Langevin Dynamics

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (4)