Table of Contents
Fetching ...

Neural non-canonical Hamiltonian dynamics for long-time simulations

Clémentine Courtès, Emmanuel Franck, Michael Kraus, Laurent Navoret, Léopold Trémant

TL;DR

This work addresses learning dynamics of non-canonical Hamiltonian systems from data with the goal of reliable long-time simulations. It shows that preserving geometric structure through learning is insufficient by itself if the numerical scheme introduces gauge-dependent errors, and proposes two complementary strategies: vector-field learning with a gauge-aware regularization, and scheme learning that directly fits a discrete, structure-preserving flow via a Degenerate Variational Integrator. Through experiments on Lotka–Volterra, a massless charged particle, and a guiding-center model, the authors demonstrate how regularization mitigates instability in vector-field learning, while scheme-learning yields superior long-time accuracy when the training time-step and scheme are aligned. The results highlight the importance of harmonizing learned structure, invariants, and the chosen integrator to achieve stable, accurate long-time behavior in non-canonical Hamiltonian dynamics, with applications to plasma physics and related areas.

Abstract

This work focuses on learning non-canonical Hamiltonian dynamics from data, where long-term predictions require the preservation of structure both in the learned model and in numerical schemes. Previous research focused on either facet, respectively with a potential-based architecture and with degenerate variational integrators, but new issues arise when combining both. In experiments, the learnt model is sometimes numerically unstable due to the gauge dependency of the scheme, rendering long-time simulations impossible. In this paper, we identify this problem and propose two different training strategies to address it, either by directly learning the vector field or by learning a time-discrete dynamics through the scheme. Several numerical test cases assess the ability of the methods to learn complex physical dynamics, like the guiding center from gyrokinetic plasma physics.

Neural non-canonical Hamiltonian dynamics for long-time simulations

TL;DR

This work addresses learning dynamics of non-canonical Hamiltonian systems from data with the goal of reliable long-time simulations. It shows that preserving geometric structure through learning is insufficient by itself if the numerical scheme introduces gauge-dependent errors, and proposes two complementary strategies: vector-field learning with a gauge-aware regularization, and scheme learning that directly fits a discrete, structure-preserving flow via a Degenerate Variational Integrator. Through experiments on Lotka–Volterra, a massless charged particle, and a guiding-center model, the authors demonstrate how regularization mitigates instability in vector-field learning, while scheme-learning yields superior long-time accuracy when the training time-step and scheme are aligned. The results highlight the importance of harmonizing learned structure, invariants, and the chosen integrator to achieve stable, accurate long-time behavior in non-canonical Hamiltonian dynamics, with applications to plasma physics and related areas.

Abstract

This work focuses on learning non-canonical Hamiltonian dynamics from data, where long-term predictions require the preservation of structure both in the learned model and in numerical schemes. Previous research focused on either facet, respectively with a potential-based architecture and with degenerate variational integrators, but new issues arise when combining both. In experiments, the learnt model is sometimes numerically unstable due to the gauge dependency of the scheme, rendering long-time simulations impossible. In this paper, we identify this problem and propose two different training strategies to address it, either by directly learning the vector field or by learning a time-discrete dynamics through the scheme. Several numerical test cases assess the ability of the methods to learn complex physical dynamics, like the guiding center from gyrokinetic plasma physics.

Paper Structure

This paper contains 49 sections, 77 equations, 23 figures.

Figures (23)

  • Figure 1: (Lotka-Volterra) Long-time behaviour of the RK4 and DVI numerical schemes, with initial condition $x_0 = 1$, $y_0 = 1$ and time-step $h = 0.2$. Left: solutions in phase space. Only one out of 16 time-steps is displayed for the RK4 solution (to reduce file size), and only the first time-steps of the DVI solution is displayed (the solution is periodic). Right: relative error on the Hamiltonian as a function of time.
  • Figure 2: (Lotka-Volterra) Solutions with initial condition $x_0 = 1$, $y_0 = 1$ for the different models (reference model \ref{['eq:LotkaVolterraDiffEq']}, no structure neural model \ref{['eq:nostructuremodel']}, canonical neural model \ref{['eq:canonicalmodel']}, non-canonical neural model \ref{['eq:structNeuralNet']}), obtained using solve_ivp.
  • Figure 3: (Lotka-Volterra) Solutions with initial condition $x_0 = 1$, $y_0 = 1$ for the different models (reference model \ref{['eq:LotkaVolterraDiffEq']}, no structure neural model \ref{['eq:nostructuremodel']}, canonical neural model \ref{['eq:canonicalmodel']}, non-canonical neural model \ref{['eq:structNeuralNet']}), obtained using solve_ivp. Left: phase portrait. Right: time evolution of the relative error of the Hamiltonian \ref{['eq:LotkaVolterraLagrangian']} for the different solutions.
  • Figure 4: (Lotka-Volterra) Solutions with initial condition $x_0 = 4$, $y_0 = 3$ obtained using the first-order DVI scheme with different time steps for different models. Left: reference model \ref{['eq:LotkaVolterraLagrangian']}. Middle: a perturbed model $\vartheta(x, y) \leftarrow \vartheta(x, y) + \frac{1}{2} \cos(2x)$. Right: a non-canonical neural model \ref{['eq:structNeuralNet']}. Exact solutions refers to refined solutions obtained using solve_ivp on the same model.
  • Figure 5: (Lotka-Volterra) Numerical solutions of the reference model \ref{['eq:LotkaVolterraDiffEq']} with different initial data. Left: RK4 solver with time-step $h = 0.1$ and 250k steps. Right: DVI solver with time-step $h = 0.1$ and 100k steps. Points are displayed every 51 steps. Exact solutions are refined numerical solutions of the reference model.
  • ...and 18 more figures

Theorems & Definitions (4)

  • Remark 2.1
  • Remark 2.2
  • Remark 4.1
  • Remark A.1