Quantum Geometry insights in Deep Learning
Noémie C. Combe
TL;DR
The paper argues that the Monge–Ampère equation and related optimal-transport principles provide a rigorous, geometric foundation for deep generative learning in Boltzmann machines and energy-based models. It shows that latent Gaussian mappings produce covariance structures that form a Wishart cone, which coincides with the Connes–Araki–Haagerup cone from quantum geometry, linking learning dynamics to modular flow concepts. It simultaneously offers an alternative Renormalization Group (RG) viewpoint, interpreting hidden layers as coarse-graining steps and highlighting the role of Monge–Ampère constraints in multi-scale representations. These dual perspectives unify statistical mechanics, quantum geometry, and deep learning theory, suggesting new optimization strategies and interpretable frameworks for hierarchical feature learning with potential practical impact on robust generative modeling.
Abstract
In this paper, we explore the fundamental role of the Monge-Ampère equation in deep learning, particularly in the context of Boltzmann machines and energy-based models. We first review the structure of Boltzmann learning and its relation to free energy minimization. We then establish a connection between optimal transport theory and deep learning, demonstrating how the Monge-Ampère equation governs probability transformations in generative models. Additionally, we provide insights from quantum geometry, showing that the space of covariance matrices arising in the learning process coincides with the Connes-Araki-Haagerup (CAH) cone in von Neumann algebra theory. Furthermore, we introduce an alternative approach based on renormalization group (RG) flow, which, while distinct from the optimal transport perspective, reveals another manifestation of the Monge-Ampère domain in learning dynamics. This dual perspective offers a deeper mathematical understanding of hierarchical feature learning, bridging concepts from statistical mechanics, quantum geometry, and deep learning theory.
