Generative Neural Operators through Diffusion Last Layer
Sungwon Park, Anthony Zhou, Hongjoong Kim, Amir Barati Farimani
TL;DR
This work addresses uncertainty quantification for neural operators by introducing the diffusion last layer (DLL), a lightweight head that upgrades any deterministic neural-operator backbone into a conditional generative surrogate. DLL learns an input-conditioned, low-rank KL representation of outputs and trains a diffusion model in coefficient space to sample from $p(x|a)$, preserving discretization invariance and enabling efficient uncertainty modeling. Across stochastic PDE benchmarks, DLL improves both distributional fidelity and calibration, while also enhancing long-horizon rollout stability in deterministic chaotic systems. The approach combines an operator encoder with a compatibility diffusion head, offering a practical, scalable path to uncertainty-aware operator learning with broad applicability to inverse problems and irregular geometries.
Abstract
Neural operators have emerged as a powerful paradigm for learning discretization-invariant function-to-function mappings in scientific computing. However, many practical systems are inherently stochastic, making principled uncertainty quantification essential for reliable deployment. To address this, we introduce a simple add-on, the diffusion last layer (DLL), a lightweight probabilistic head that can be attached to arbitrary neural operator backbones to model predictive uncertainty. Motivated by the relative smoothness and low-dimensional structure often exhibited by PDE solution distributions, DLL parameterizes the conditional output distribution directly in function space through a low-rank Karhunen-Loève expansion, enabling efficient and expressive uncertainty modeling. Across stochastic PDE operator learning benchmarks, DLL improves generalization and uncertainty-aware prediction. Moreover, even in deterministic long-horizon rollout settings, DLL enhances rollout stability and provides meaningful estimates of epistemic uncertainty for backbone neural operators.
