Landscaper: Understanding Loss Landscapes Through Multi-Dimensional Topological Analysis
Jiaqing Chen, Nicholas Hadler, Tiankai Xie, Rostyslav Hnatyshyn, Caleb Geniesse, Yaoqing Yang, Michael W. Mahoney, Talita Perciano, John F. Hartwig, Ross Maciejewski, Gunther H. Weber
TL;DR
This work tackles understanding neural network optimization and generalization through high-dimensional loss landscapes, where traditional low-dimensional analyses fall short. It introduces Landscaper, an open-source framework that combines Hessian-based subspace analysis with topological data analysis and introduces $SMAD$ as a global landscape smoothness metric. Across CNNs, Transformers, GNNs, and SciML tasks, Landscaper reveals training dynamics and OOD generalization patterns that curvature-based metrics miss, demonstrating the value of global topological insights for model diagnostics. The approach offers practical impact for architecture design in data-scarce settings and outlines clear paths for scalability and broader validation in future work.
Abstract
Loss landscapes are a powerful tool for understanding neural network optimization and generalization, yet traditional low-dimensional analyses often miss complex topological features. We present Landscaper, an open-source Python package for arbitrary-dimensional loss landscape analysis. Landscaper combines Hessian-based subspace construction with topological data analysis to reveal geometric structures such as basin hierarchy and connectivity. A key component is the Saddle-Minimum Average Distance (SMAD) for quantifying landscape smoothness. We demonstrate Landscaper's effectiveness across various architectures and tasks, including those involving pre-trained language models, showing that SMAD captures training transitions, such as landscape simplification, that conventional metrics miss. We also illustrate Landscaper's performance in challenging chemical property prediction tasks, where SMAD can serve as a metric for out-of-distribution generalization, offering valuable insights for model diagnostics and architecture design in data-scarce scientific machine learning scenarios.
