Table of Contents
Fetching ...

Statistical physics for artificial neural networks

Zongrui Pei

TL;DR

The paper reviews the connections between spin-glass physics and artificial neural networks, emphasizing how energy landscapes, metastability, and mean-field methods illuminate ANN dynamics, memory, and training. It surveys Hopfield and Boltzmann machines, replica theory and the cavity method, and the implications of overparameterization and double descent, while outlining challenges in extending these ideas to deep networks. It argues for the development of new order parameters and the potential of quantum computing to address rugged loss landscapes and scaling limits. The work highlights the bidirectional benefits of this interdisciplinary approach for both fundamental physics and practical machine learning, and sketches future opportunities in neural computing hardware and quantum–neural paradigms.

Abstract

The 2024 Nobel Prize in Physics was awarded for pioneering contributions at the intersection of artificial neural networks (ANNs) and spin-glass physics, underscoring the profound connections between these fields. The topological similarities between ANNs and Ising-type models, such as the Sherrington-Kirkpatrick model, reveal shared structures that bridge statistical physics and machine learning. In this perspective, we explore how concepts and methods from statistical physics, particularly those related to glassy and disordered systems like spin glasses, are applied to the study and development of ANNs. We discuss the key differences, common features, and deep interconnections between spin glasses and neural networks while highlighting future directions for this interdisciplinary research. Special attention is given to the synergy between spin-glass studies and neural network advancements and the challenges that remain in statistical physics for ANNs. Finally, we examine the transformative role that quantum computing could play in addressing these challenges and propelling this research frontier forward.

Statistical physics for artificial neural networks

TL;DR

The paper reviews the connections between spin-glass physics and artificial neural networks, emphasizing how energy landscapes, metastability, and mean-field methods illuminate ANN dynamics, memory, and training. It surveys Hopfield and Boltzmann machines, replica theory and the cavity method, and the implications of overparameterization and double descent, while outlining challenges in extending these ideas to deep networks. It argues for the development of new order parameters and the potential of quantum computing to address rugged loss landscapes and scaling limits. The work highlights the bidirectional benefits of this interdisciplinary approach for both fundamental physics and practical machine learning, and sketches future opportunities in neural computing hardware and quantum–neural paradigms.

Abstract

The 2024 Nobel Prize in Physics was awarded for pioneering contributions at the intersection of artificial neural networks (ANNs) and spin-glass physics, underscoring the profound connections between these fields. The topological similarities between ANNs and Ising-type models, such as the Sherrington-Kirkpatrick model, reveal shared structures that bridge statistical physics and machine learning. In this perspective, we explore how concepts and methods from statistical physics, particularly those related to glassy and disordered systems like spin glasses, are applied to the study and development of ANNs. We discuss the key differences, common features, and deep interconnections between spin glasses and neural networks while highlighting future directions for this interdisciplinary research. Special attention is given to the synergy between spin-glass studies and neural network advancements and the challenges that remain in statistical physics for ANNs. Finally, we examine the transformative role that quantum computing could play in addressing these challenges and propelling this research frontier forward.

Paper Structure

This paper contains 14 sections, 7 equations, 7 figures, 1 table.

Figures (7)

  • Figure 1: The applications of artificial neural networks in physics. Artificial neural networks have been widely utilized to solve a diverse range of problems, including those in scientific research. Here, we show a few typical examples in physical research, i.e., (i) many-body interactions (machine-learning potentials) unke2019physnetzhang2018deep, (ii) thermodynamics (proposal of order parameters to describe phase transitions) yin2021neuralrogal2019neural, (iii) discovery of physical laws and concepts iten2020discovering, (iv) discovery or design of new materials pei2021machine, and (v) language models for materials design pei2023towardTshitoyan2019pei2024towards.
  • Figure 2: The history of artificial neural networks and relevant physical models. Other landmarks include backpropagation in the 1970s and AlexNet in 2012. Interestingly, bio-inspired ANNs ultimately led to the solution of the protein folding problem that has plagued us for half a century.
  • Figure 3: Basic concepts in statistical physics and thermodynamics. Here, we show that each pattern (memory or configuration) corresponds to an energy state in the rugged energy landscape of the free energy function or an error function.
  • Figure 4: Topology of an artificial neural network and Sherrington-Kirkpatrick (SK) model. a, A simple, fully connected neural network with one hidden layer. b, A four-spin mean field SK spin-glass model.
  • Figure 5: Biological neurons, artificial neural networks, and action potentials. a, Biological neuron and its associated concepts duan2020spiking. b, Artificial neural network with input features $V_{\mathrm{in},i}$ and output features $V_{\mathrm{out},i}$duan2020spiking. Here, the resistor symbols (rectangles) represent the activation functions that switch the signals from individual inputs on or off. c Activation potentials naundorf2006unique. The top panel represents an action potential in a cat visual cortex neuron in vivo. The middle panel is an action potential from a cat visual cortical slice in vivo at 20°C. The bottom panel is a model potential. The arrow indicates the characteristic kink at the onset of the action potential. This figure is adapted from Refs. duan2020spikingnaundorf2006unique and modified.
  • ...and 2 more figures