Neural Network Perturbation Theory (NNPT): Learning Residual Corrections from Exact Solutions
Zhenhao Chen, Mutian Shen, Boris Fain, Zohar Nussinov
TL;DR
This work presents Neural Network Perturbation Theory (NNPT), a strategy to learn only residual perturbations after subtracting analytically solvable baselines, applied to the planar circular restricted three-body problem to probe how dynamical complexity governs neural capacity. By constraining the network to model Jupiter’s perturbation on top of the Keplerian baseline, the authors achieve substantial parameter efficiency and demonstrate a sharp capacity transition near chaos onset that scales with the resonance-overlap criterion and precedes geometric chaos signatures. The study also employs autoencoder analyses to quantify intrinsic dimensionality, revealing a torus-dominated structure in integrable regimes that collapses into higher-dimensional chaotic attractors as dynamics become chaotic, though decoder overhead can temper latent-space benefits for trajectory prediction. Collectively, NNPT provides a general framework for building physics-informed, capacity-aware surrogates and highlights fundamental limits on fixed-architecture networks in chaotic regimes, with implications across quantum, fluid, and plasma systems.
Abstract
Many complex physical systems admit natural decomposition into an exactly solvable component and a perturbative correction. Rather than training neural networks to learn complete trajectories from scratch, we introduce Neural Network Perturbation Theory (NNPT), where networks predict only residual perturbations after analytically subtracting known exact solutions. We validate this framework through systematic comparison: using identical 2x32 architectures, correction learning achieves 28-54x lower validation error compared to networks trained on complete trajectories. Using the gravitational three-body problem as a test bed, we investigate capacity transitions in fixed-architecture multilayer perceptrons as Jovian mass varies from 0.05 to 30 times its physical value. An equalized-accuracy protocol reveals that both minimal network capacity and training time exhibit sharp transitions at f_c = 15.6+-1.0, where the system enters a strongly chaotic regime. At this transition, minimal capacity jumps approximately sevenfold from ~1,200 to ~8,600 parameters (architectures 2x32 and 3x64). Preliminary exploration of sequential two-stage corrections suggests that first-stage networks already capture dominant perturbative features. Our symplectic integrator maintains relative energy conservation below 2x10^-7 throughout, confirming that transitions reflect physical complexity rather than numerical error. Our results establish correction learning as a general strategy for parameter-efficient surrogates and demonstrate that physical complexity imposes fundamental capacity barriers on fixed-architecture networks at chaos onset.
