Extrapolation and learning equations

Georg Martius; Christoph H. Lampert

Extrapolation and learning equations

Georg Martius, Christoph H. Lampert

TL;DR

The paper tackles extrapolation in regression for physical systems by introducing the Equation Learner (EQL), a differentiable network that learns analytic expressions using a structured mix of unary and pairwise multiplication units. EQL is trained with a staged sparsity strategy and a model-selection criterion designed to favor simple, extrapolable formulas, enabling interpretable equation discovery. Through experiments on pendulum dynamics, double pendulum kinematics, planar robotic arms, synthetic formula learning, and X-ray transition energies, EQL demonstrates strong extrapolation capability and the ability to recover or approximate underlying physical laws, while highlighting limitations when the true relation lies outside the hypothesized function set. The work provides a pathway toward physics-informed, interpretable models with generalization beyond training domains, and points to future extensions of the base function set to broaden applicability.

Abstract

In classical machine learning, regression is treated as a black box process of identifying a suitable function from a hypothesis set without attempting to gain insight into the mechanism connecting inputs and outputs. In the natural sciences, however, finding an interpretable function for a phenomenon is the prime goal as it allows to understand and generalize results. This paper proposes a novel type of function learning network, called equation learner (EQL), that can learn analytical expressions and is able to extrapolate to unseen domains. It is implemented as an end-to-end differentiable feed-forward network and allows for efficient gradient based training. Due to sparsity regularization concise interpretable expressions can be obtained. Often the true underlying source expression is identified.

Extrapolation and learning equations

TL;DR

Abstract

Extrapolation and learning equations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)