MetaSym: A Symplectic Meta-learning Framework for Physical Intelligence
Pranav Vaidhyanathan, Aristotelis Papatheodorou, Mark T. Mitchison, Natalia Ares, Ioannis Havoutis
TL;DR
MetaSym addresses the challenge of scalable physics-aware dynamics modeling by embedding a symplectic prior into a neural encoder and coupling it with a meta-learned autoregressive decoder. The SymplecticEncoder preserves the symplectic form $d\text{Φ}_{\theta}(\mathbf{x})^{\top} J\, d\text{Φ}_{\theta}(\mathbf{x}) = J$ and, with time-reversal training, mitigates energy drift, while the ActiveDecoder uses meta-attention to adapt to nonconservative forces and system variations. The framework is validated on three domains—high-dimensional spring meshes, open quantum systems, and quadrotor dynamics—showing superior few-shot adaptation and long-horizon accuracy using a model smaller than baselines. These results suggest that combining structure-preserving priors with targeted meta-learning yields robust, data-efficient physical predictions suitable for real-time decision-making and control.
Abstract
Scalable and generalizable physics-aware deep learning has long been considered a significant challenge with various applications across diverse domains ranging from robotics to molecular dynamics. Central to almost all physical systems are symplectic forms, the geometric backbone that underpins fundamental invariants like energy and momentum. In this work, we introduce a novel deep learning framework, MetaSym. In particular, MetaSym combines a strong symplectic inductive bias obtained from a symplectic encoder, and an autoregressive decoder with meta-attention. This principled design ensures that core physical invariants remain intact, while allowing flexible, data-efficient adaptation to system heterogeneities. We benchmark MetaSym with highly varied and realistic datasets, such as a high-dimensional spring-mesh system (Otness et al., 2021), an open quantum system with dissipation and measurement backaction, and robotics-inspired quadrotor dynamics. Our results demonstrate superior performance in modeling dynamics under few-shot adaptation, outperforming state-of-the-art baselines that use larger models.
