Quantifying Manifolds: Do the manifolds learned by Generative Adversarial Networks converge to the real data manifold

Anupam Chaudhuri; Anj Simmons; Mohamed Abdelrazek

Quantifying Manifolds: Do the manifolds learned by Generative Adversarial Networks converge to the real data manifold

Anupam Chaudhuri, Anj Simmons, Mohamed Abdelrazek

TL;DR

The paper addresses how GANs learn data manifolds and whether these learned manifolds converge to the real data manifold. It adopts Topological Data Analysis (TDA) by leveraging persistence diagrams and Vietoris–Rips filtrations to quantify intrinsic dimensions and topological features ($H_0$, $H_1$, $H_2$) of both real and generated data across training epochs, using entropy and Wasserstein distances as comparators. Experiments on CIFAR-10 cat images show that the GAN-generated manifold's intrinsic dimension and topological summaries tend to converge toward the real data manifold over training, with $H_0$ converging faster than higher homology groups. The work presents a multi-metric framework for GAN manifold quantification, offering a principled lens on robustness and data geometry and suggesting broader applicability to other generative models.

Abstract

This paper presents our experiments to quantify the manifolds learned by ML models (in our experiment, we use a GAN model) as they train. We compare the manifolds learned at each epoch to the real manifolds representing the real data. To quantify a manifold, we study the intrinsic dimensions and topological features of the manifold learned by the ML model, how these metrics change as we continue to train the model, and whether these metrics convergence over the course of training to the metrics of the real data manifold.

Quantifying Manifolds: Do the manifolds learned by Generative Adversarial Networks converge to the real data manifold

TL;DR

) of both real and generated data across training epochs, using entropy and Wasserstein distances as comparators. Experiments on CIFAR-10 cat images show that the GAN-generated manifold's intrinsic dimension and topological summaries tend to converge toward the real data manifold over training, with

converging faster than higher homology groups. The work presents a multi-metric framework for GAN manifold quantification, offering a principled lens on robustness and data geometry and suggesting broader applicability to other generative models.

Abstract

Paper Structure (14 sections, 22 equations, 2 figures)

This paper contains 14 sections, 22 equations, 2 figures.

Introduction
Topological Data analysis
Simplicial complex and Simplicial Chain
Persistence Diagram
Categorial framework to understand the quantification
Manifold Learning
Manifold Quantification Metrics
Intrinsic dimensions
Topological Features
Gaps
Experiment
Dataset and Model
Results
Conclusion

Figures (2)

Figure 1: Intrinsic Dimension vs Epoch
Figure 2: Topological Features vs Epoch

Theorems & Definitions (4)

definition 1
definition 2
definition 3: Manifold
definition 4

Quantifying Manifolds: Do the manifolds learned by Generative Adversarial Networks converge to the real data manifold

TL;DR

Abstract

Quantifying Manifolds: Do the manifolds learned by Generative Adversarial Networks converge to the real data manifold

Authors

TL;DR

Abstract

Table of Contents

Figures (2)

Theorems & Definitions (4)