Loss Barcode: A Topological Measure of Escapability in Loss Landscapes

Serguei Barannikov; Daria Voronkova; Alexander Mironenko; Ilya Trofimov; Alexander Korotin; Grigorii Sotnikov; Evgeny Burnaev

Loss Barcode: A Topological Measure of Escapability in Loss Landscapes

Serguei Barannikov, Daria Voronkova, Alexander Mironenko, Ilya Trofimov, Alexander Korotin, Grigorii Sotnikov, Evgeny Burnaev

TL;DR

This paper uses the loss function topology to relate the local behavior of gradient descent trajectories with the global properties of the loss surface, and defines the neural network's Topological Obstructions score with the help of robust topological invariants, barcodes of the loss function, which quantify the escapability of local minima for gradient-based optimization.

Abstract

Neural network training is commonly based on SGD. However, the understanding of SGD's ability to converge to good local minima, given the non-convex nature of loss functions and the intricate geometric characteristics of loss landscapes, remains limited. In this paper, we apply topological data analysis methods to loss landscapes to gain insights into the learning process and generalization properties of deep neural networks. We use the loss function topology to relate the local behavior of gradient descent trajectories with the global properties of the loss surface. For this purpose, we define the neural network's Topological Obstructions score ("TO-score") with the help of robust topological invariants, barcodes of the loss function, which quantify the escapability of local minima for gradient-based optimization. Our two principal observations are: 1) the loss barcode of the neural network decreases with increasing depth and width, therefore the topological obstructions to learning diminish; 2) in certain situations there is a connection between the length of minima segments in the loss barcode and the minima's generalization errors. Our statements are based on extensive experiments with fully connected, convolutional, and transformer architectures and several datasets including MNIST, FMNIST, CIFAR10, CIFAR100, SVHN, and multilingual OSCAR text dataset.

Loss Barcode: A Topological Measure of Escapability in Loss Landscapes

TL;DR

Abstract

Loss Barcode: A Topological Measure of Escapability in Loss Landscapes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (20)

Theorems & Definitions (12)