CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

Pratik Jawahar; Maurizio Pierini

CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

Pratik Jawahar, Maurizio Pierini

TL;DR

It is shown that the CHLU is designed for infinite-horizon stability, as well as controllable noise filtering, and strictly conserves phase-space volume, as an attempt to solve the memory-stability trade-off.

Abstract

Current deep learning primitives dealing with temporal dynamics suffer from a fundamental dichotomy: they are either discrete and unstable (LSTMs) \citep{pascanu_difficulty_2013}, leading to exploding or vanishing gradients; or they are continuous and dissipative (Neural ODEs) \citep{dupont_augmented_2019}, which destroy information over time to ensure stability. We propose the \textbf{Causal Hamiltonian Learning Unit} (pronounced: \textit{clue}), a novel Physics-grounded computational learning primitive. By enforcing a Relativistic Hamiltonian structure and utilizing symplectic integration, a CHLU strictly conserves phase-space volume, as an attempt to solve the memory-stability trade-off. We show that the CHLU is designed for infinite-horizon stability, as well as controllable noise filtering. We then demonstrate a CHLU's generative ability using the MNIST dataset as a proof-of-principle.

CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

TL;DR

Abstract

Paper Structure (26 sections, 8 equations, 8 figures, 1 algorithm)

This paper contains 26 sections, 8 equations, 8 figures, 1 algorithm.

Introduction
Relativistic Kinetic Governor:
Action Propagation:
Methodology: The Geometry of a CHLU
The Separable Hamiltonian Engine
Relativistic Kinetic Governor
Symplectic Integration via Velocity Verlet
Training Dynamics: Hamiltonian Contrastive Divergence
Hamiltonian Annealing:
Generative Sampling via Langevin Dynamics:
Scope of Experiments and Results
Experiment I: Long-Horizon Stability (Lemniscate Tracing)
Experiment II: Kinetic Safety (The Perturbed Sine Wave)
Experiment III: Thermodynamic Generation
Conclusion: From Learning Dynamics to Enforcing Geometry
...and 11 more sections

Figures (8)

Figure 1: Lemniscate tracing experiment. The transparent lines show inference cycles $0-48$ and the solid lines show the final $2$ cycles, for LSTM, NODE and CHLU (left to right)
Figure 2: The LSTM, NODE and CHLU (left to right), each predicts the state $q$ given the same initial perturbed states, from which we calculate the KE and overlay it on the expected KE.
Figure 3: MNIST digits generated from the centroid of the test set with added noise.
Figure 4: The learned lemniscate potential energy plotted as $2D$, $3D$ heatmaps (top left, right) as well as a force field (bottom). The output trajectory is overlayed on all $3$ for visual comparison.
Figure 5: Kinetic energies calculated from the output state $q$ (top 3 rows), compared to the kinetic energies calculated from output state $p$ (bottom 3 rows) for the same $3$ sine waves with the same inital perturbed conditions for all models (left to right: LSTM, NODE, CHLU).
...and 3 more figures

CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

TL;DR

Abstract

CHLU: The Causal Hamiltonian Learning Unit as a Symplectic Primitive for Deep Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (8)