Formalized Hopfield Networks and Boltzmann Machines
Matteo Cipollina, Michail Karatarakis, Freek Wiedijk
TL;DR
This work formalizes both deterministic Hopfield networks and stochastic Boltzmann machines in Lean 4, establishing convergence, ergodicity, and learning properties within a unified graph-based framework. It develops a rigorous probabilistic foundation using Markov kernels, Gibbs sampling, and a novel formalization of the Perron–Frobenius theorem via a quiver-based irreducibility notion, enabling proofs of a unique stationary distribution for Boltzmann dynamics. Key contributions include formal proofs of Hopfield convergence under asynchronous updates, Hebbian learning for both single and orthogonal multi-pattern storage, and integration with PhysLean to connect neural dynamics to canonical ensembles. The approach emphasizes global dynamic properties of recurrent models over step-by-step computation, offering a robust, verifiable bridge between statistical mechanics, machine learning, and formal mathematics with clear paths for future MCMC and PF enhancements. This formalization advances the reliability and verifiability of neural-network theories by embedding them in a rigorous, library-wide Lean environment.
Abstract
Neural networks are widely used, yet their analysis and verification remain challenging. In this work, we present a Lean 4 formalization of neural networks, covering both deterministic and stochastic models. We first formalize Hopfield networks, recurrent networks that store patterns as stable states. We prove convergence and the correctness of Hebbian learning, a training rule that updates network parameters to encode patterns, here limited to the case of pairwise-orthogonal patterns. We then consider stochastic networks, where updates are probabilistic and convergence is to a stationary distribution. As a canonical example, we formalize the dynamics of Boltzmann machines and prove their ergodicity, showing convergence to a unique stationary distribution using a new formalization of the Perron-Frobenius theorem.
