Topological Data Analysis for Neural Network Analysis: A Comprehensive Survey
Rubén Ballester, Carles Casacuberta, Sergio Escalera
TL;DR
Topological Data Analysis (TDA) is applied to neural networks to uncover geometric and topological structure in architectures, inputs, activations, and training dynamics using persistent homology, Mapper, and GTDA. The survey compiles findings across four analysis domains and highlights correlations between topological features such as $b_n$ and persistence diagrams with generalization, robustness, and generative model quality. It also reviews practical applications including regularization, pruning, adversarial and Trojan detection, model selection, and accuracy prediction, while noting computational challenges. The authors discuss future directions such as $persistent\ path\ topology$ for directed graphs and $multiparameter\ persistence$ to capture joint filtrations, aiming to extend TDA's applicability to modern architectures. Overall, the work offers a synthesis of how topology informs understanding and design of neural networks, and points to theory and scalable algorithms as key future needs.
Abstract
This survey provides a comprehensive exploration of applications of Topological Data Analysis (TDA) within neural network analysis. Using TDA tools such as persistent homology and Mapper, we delve into the intricate structures and behaviors of neural networks and their datasets. We discuss different strategies to obtain topological information from data and neural networks by means of TDA. Additionally, we review how topological information can be leveraged to analyze properties of neural networks, such as their generalization capacity or expressivity. We explore practical implications of deep learning, specifically focusing on areas like adversarial detection and model selection. Our survey organizes the examined works into four broad domains: 1. Characterization of neural network architectures; 2. Analysis of decision regions and boundaries; 3. Study of internal representations, activations, and parameters; 4. Exploration of training dynamics and loss functions. Within each category, we discuss several articles, offering background information to aid in understanding the various methodologies. We conclude with a synthesis of key insights gained from our study, accompanied by a discussion of challenges and potential advancements in the field.
