Hamiltonian Generative Networks
Peter Toth, Danilo Jimenez Rezende, Andrew Jaegle, Sébastien Racanière, Aleksandar Botev, Irina Higgins
TL;DR
This work introduces the Hamiltonian Generative Network (HGN), a model that learns Hamiltonian dynamics directly from pixel observations and enables reversible, controllable rollouts as well as a density-modelling variant (NHF) based on Hamiltonian flows. The architecture combines an inference network, a Hamiltonian transition network, and a decoder to infer abstract phase-space states and simulate their evolution via Hamilton's equations with a symplectic leapfrog integrator. Empirical results on mass-spring, pendulum, and multi-body systems show that HGN significantly outperforms the Hamiltonian Neural Network baselines and supports robust, energy-conserving dynamics; NHF demonstrates expressive density modelling with interpretable potential landscapes and competitive efficiency. The approach bridges physics-inspired dynamics with deep generative modelling, offering practical benefits for long-horizon prediction, reversible inference, and multimodal density estimation in dynamical systems.
Abstract
The Hamiltonian formalism plays a central role in classical and quantum physics. Hamiltonians are the main tool for modelling the continuous time evolution of systems with conserved quantities, and they come equipped with many useful properties, like time reversibility and smooth interpolation in time. These properties are important for many machine learning problems - from sequence prediction to reinforcement learning and density modelling - but are not typically provided out of the box by standard tools such as recurrent neural networks. In this paper, we introduce the Hamiltonian Generative Network (HGN), the first approach capable of consistently learning Hamiltonian dynamics from high-dimensional observations (such as images) without restrictive domain assumptions. Once trained, we can use HGN to sample new trajectories, perform rollouts both forward and backward in time and even speed up or slow down the learned dynamics. We demonstrate how a simple modification of the network architecture turns HGN into a powerful normalising flow model, called Neural Hamiltonian Flow (NHF), that uses Hamiltonian dynamics to model expressive densities. We hope that our work serves as a first practical demonstration of the value that the Hamiltonian formalism can bring to deep learning.
