Beyond Loss Guidance: Using PDE Residuals as Spectral Attention in Diffusion Neural Operators

Medha Sawhney; Abhilash Neog; Mridul Khurana; Anuj Karpatne

Beyond Loss Guidance: Using PDE Residuals as Spectral Attention in Diffusion Neural Operators

Medha Sawhney, Abhilash Neog, Mridul Khurana, Anuj Karpatne

TL;DR

PRISMA tackles slow and unstable inference in diffusion-based PDE solvers by embedding PDE residuals directly into the model architecture. It introduces Spectral Residual Attention (SRA) within a conditional U-shaped diffusion operator to provide physics-guided attention in the spectral domain, enabling gradient-descent free inference for both forward and inverse PDE problems. The approach yields 15x–250x faster inference with competitive or superior accuracy, especially under noisy observations, and remains robust across full, sparse, and noisy data configurations. This unified, residual-informed diffusion framework has practical impact for fast, reliable PDE solving in scientific computing with imperfect data.

Abstract

Diffusion-based solvers for partial differential equations (PDEs) are often bottle-necked by slow gradient-based test-time optimization routines that use PDE residuals for loss guidance. They additionally suffer from optimization instabilities and are unable to dynamically adapt their inference scheme in the presence of noisy PDE residuals. To address these limitations, we introduce PRISMA (PDE Residual Informed Spectral Modulation with Attention), a conditional diffusion neural operator that embeds PDE residuals directly into the model's architecture via attention mechanisms in the spectral domain, enabling gradient-descent free inference. In contrast to previous methods that use PDE loss solely as external optimization targets, PRISMA integrates PDE residuals as integral architectural features, making it inherently fast, robust, accurate, and free from sensitive hyperparameter tuning. We show that PRISMA has competitive accuracy, at substantially lower inference costs, compared to previous methods across five benchmark PDEs, especially with noisy observations, while using 10x to 100x fewer denoising steps, leading to 15x to 250x faster inference.

Beyond Loss Guidance: Using PDE Residuals as Spectral Attention in Diffusion Neural Operators

TL;DR

Abstract

Beyond Loss Guidance: Using PDE Residuals as Spectral Attention in Diffusion Neural Operators

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (17)