Learning advanced mathematical computations from examples
François Charton, Amaury Hayat, Guillaume Lample
TL;DR
The paper demonstrates that transformers trained on expansive synthetic datasets can predict both qualitative and numerical properties of advanced mathematical problems without embedding mathematical knowledge. By targeting local stability, controllability, and PDE behavior, it shows near-perfect qualitative accuracy and strong numerical performance, often surpassing simple baselines and generalizing to longer expressions and new problem distributions. The work highlights the potential of neural sequence models to learn symbolic-numeric computation, while acknowledging that learned solutions may rely on shortcuts rather than explicit mathematical reasoning. These findings open avenues for fast, parallelizable alternatives to traditional solvers and motivate further investigation into interpretability and the limits of such learned mathematical reasoning.
Abstract
Using transformers over large generated datasets, we train models to learn mathematical properties of differential systems, such as local stability, behavior at infinity and controllability. We achieve near perfect prediction of qualitative characteristics, and good approximations of numerical features of the system. This demonstrates that neural networks can learn to perform complex computations, grounded in advanced theory, from examples, without built-in mathematical knowledge.
