Rates and architectures for learning geometrically non-trivial operators
T. Mitchell Roddenberry, Leo Tzou, Ivan Dokmanić, Maarten V. de Hoop, Richard G. Baraniuk
TL;DR
This work extends operator learning to geometrically non-trivial mappings by introducing double fibration transforms, such as Radon and geodesic ray transforms, and proving they are not cursed by dimensionality when learned from random, bandlimited samples. It develops a levelset integral kernel parameterization that implicitly encodes the incidence relation and amplitude, with a cross-attention–style factorization that achieves universality and stability under discretization. Theoretical results show superalgebraic learning rates under Bolker-type geometry and smoothing-induced bandwidth control, while practical methods enable accurate learning from very few examples. Empirically, the approach excels at learning Radon and ray transforms, identifies codimension through Jacobian analysis, and extends to Riemannian inverse problems, including wavespeed recovery, demonstrating meaningful gains in data efficiency and interpretability for geometric inverse problems.
Abstract
Deep learning methods have proven capable of recovering operators between high-dimensional spaces, such as solution maps of PDEs and similar objects in mathematical physics, from very few training samples. This phenomenon of data-efficiency has been proven for certain classes of elliptic operators with simple geometry, i.e., operators that do not change the domain of the function or propagate singularities. However, scientific machine learning is commonly used for problems that do involve the propagation of singularities in a priori unknown ways, such as waves, advection, and fluid dynamics. In light of this, we expand the learning theory to include double fibration transforms--geometric integral operators that include generalized Radon and geodesic ray transforms. We prove that this class of operators does not suffer from the curse of dimensionality: the error decays superalgebraically, that is, faster than any fixed power of the reciprocal of the number of training samples. Furthermore, we investigate architectures that explicitly encode the geometry of these transforms, demonstrating that an architecture reminiscent of cross-attention based on levelset methods yields a parameterization that is universal, stable, and learns double fibration transforms from very few training examples. Our results contribute to a rapidly-growing line of theoretical work on learning operators for scientific machine learning.
