Rapid training of Hamiltonian graph networks using random features

Atamert Rahma; Chinmay Datar; Ana Cukarska; Felix Dietrich

Rapid training of Hamiltonian graph networks using random features

Atamert Rahma, Chinmay Datar, Ana Cukarska, Felix Dietrich

TL;DR

The paper tackles the slow, gradient-based training of physics-informed Hamiltonian graph networks for N-body dynamics. It introduces Random Feature Hamiltonian Graph Networks (RF-HGN), which replace iterative optimization with random feature-based dense layers and a linear least-squares readout, enabling gradient-descent-free training. By enforcing translation, rotation, and permutation invariances, RF-HGN demonstrates robust zero-shot generalization from small graphs to very large ones while maintaining energy-consistent dynamics. Across mass-spring, Lennard-Jones, and molecular dynamics benchmarks in up to 3D with thousands of particles, the method achieves comparable accuracy to state-of-the-art models but with dramatically faster training times, challenging the dominance of gradient-based optimization in physics-informed learning.

Abstract

Learning dynamical systems that respect physical symmetries and constraints remains a fundamental challenge in data-driven modeling. Integrating physical laws with graph neural networks facilitates principled modeling of complex N-body dynamics and yields accurate and permutation-invariant models. However, training graph neural networks with iterative, gradient-based optimization algorithms (e.g., Adam, RMSProp, LBFGS) often leads to slow training, especially for large, complex systems. In comparison to 15 different optimizers, we demonstrate that Hamiltonian Graph Networks (HGN) can be trained up to 600x faster--but with comparable accuracy--by replacing iterative optimization with random feature-based parameter construction. We show robust performance in diverse simulations, including N-body mass-spring and molecular systems in up to 3 dimensions and 10,000 particles with different geometries, while retaining essential physical invariances with respect to permutation, rotation, and translation. Our proposed approach is benchmarked using a NeurIPS 2022 Datasets and Benchmarks Track publication to further demonstrate its versatility. We reveal that even when trained on minimal 8-node systems, the model can generalize in a zero-shot manner to systems as large as 4096 nodes without retraining. Our work challenges the dominance of iterative gradient-descent-based optimization algorithms for training neural network models for physical systems.

Rapid training of Hamiltonian graph networks using random features

TL;DR

Abstract

Rapid training of Hamiltonian graph networks using random features

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)