A Geometric Framework for Understanding Memorization in Generative Models

Brendan Leigh Ross; Hamidreza Kamkari; Tongzi Wu; Rasa Hosseinzadeh; Zhaoyan Liu; George Stein; Jesse C. Cresswell; Gabriel Loaiza-Ganem

A Geometric Framework for Understanding Memorization in Generative Models

Brendan Leigh Ross, Hamidreza Kamkari, Tongzi Wu, Rasa Hosseinzadeh, Zhaoyan Liu, George Stein, Jesse C. Cresswell, Gabriel Loaiza-Ganem

TL;DR

The paper proposes the manifold memorization hypothesis (MMH), a geometric framework that explains memorization in deep generative models through the lens of local intrinsic dimension (LID) on the ground-truth data manifold $\\mathcal{M}_*$ and the model manifold $\\mathcal{M}_\\theta$. Memorization occurs at points where $x$ lies on $\\mathcal{M}_\\theta$ but has insufficient local dimensionality, captured by the relation $\\text{LID}_\\theta(x) < \\text{LID}_*(x)$, and is categorized into overfitting-driven (OD-Mem) and data-driven (DD-Mem) memorization. The authors validate MMH across toy data and real-world image models, develop and compare practical LID estimators (FLIPD, NB, Local PCA), and demonstrate mitigation strategies that increase LID during sampling (including CFG-based and token-attribution approaches). They also connect MMH to existing literature, offering a unified explanation for duplications, reconstructive memorization, and conditioning-induced memorization, while acknowledging limitations in estimator overlap and the need for more robust LID tools. Overall, MMH provides actionable diagnostics and mitigations for memorization, with significant implications for privacy, copyright risk, and safer deployment of generative systems. The work demonstrates that guiding generated samples toward higher local intrinsic dimensionality can reduce memorized outputs, offering a practical pathway to balance memorization risks with generation quality in large-scale diffusion models.

Abstract

As deep generative models have progressed, recent work has shown them to be capable of memorizing and reproducing training datapoints when deployed. These findings call into question the usability of generative models, especially in light of the legal and privacy risks brought about by memorization. To better understand this phenomenon, we propose the manifold memorization hypothesis (MMH), a geometric framework which leverages the manifold hypothesis into a clear language in which to reason about memorization. We propose to analyze memorization in terms of the relationship between the dimensionalities of (i) the ground truth data manifold and (ii) the manifold learned by the model. This framework provides a formal standard for "how memorized" a datapoint is and systematically categorizes memorized data into two types: memorization driven by overfitting and memorization driven by the underlying data distribution. By analyzing prior work in the context of the MMH, we explain and unify assorted observations in the literature. We empirically validate the MMH using synthetic data and image datasets up to the scale of Stable Diffusion, developing new tools for detecting and preventing generation of memorized samples in the process.

A Geometric Framework for Understanding Memorization in Generative Models

TL;DR

Abstract

A Geometric Framework for Understanding Memorization in Generative Models

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (18)

Theorems & Definitions (19)