Table of Contents
Fetching ...

Smoothness Errors in Dynamics Models and How to Avoid Them

Edward Berman, Luisa Li, Jung Yeon Park, Robin Walters

TL;DR

This work analyzes smoothness errors in dynamics models that use graph and mesh neural networks, showing that strictly unitary convolutions can be overly constraining for diffusion-like PDE tasks. It derives a theoretical lower bound on unitary approximation error and introduces relaxed unitary convolutions, including Taylor truncation and encoder-decoder strategies, to balance smoothness preservation with expressive power. The framework is extended from graphs to meshes via mesh Rayleigh quotients and unitary mesh convolutions, enabling PDE surrogates on complex geometries. Empirical results on heat and wave PDEs on PyVista meshes and WeatherBench WB2 demonstrate that relaxed-unitary models (R-UniGraph, R-UniMesh) achieve state-of-the-art or competitive performance, particularly excelling in diffusive dynamics by aligning the learned smoothness with the true system’s smoothness. The findings highlight the importance of controllable smoothness in physics-guided learning and offer practical tools for accurate, mesh-aware PDE modeling.

Abstract

Modern neural networks have shown promise for solving partial differential equations over surfaces, often by discretizing the surface as a mesh and learning with a mesh-aware graph neural network. However, graph neural networks suffer from oversmoothing, where a node's features become increasingly similar to those of its neighbors. Unitary graph convolutions, which are mathematically constrained to preserve smoothness, have been proposed to address this issue. Despite this, in many physical systems, such as diffusion processes, smoothness naturally increases and unitarity may be overconstraining. In this paper, we systematically study the smoothing effects of different GNNs for dynamics modeling and prove that unitary convolutions hurt performance for such tasks. We propose relaxed unitary convolutions that balance smoothness preservation with the natural smoothing required for physical systems. We also generalize unitary and relaxed unitary convolutions from graphs to meshes. In experiments on PDEs such as the heat and wave equations over complex meshes and on weather forecasting, we find that our method outperforms several strong baselines, including mesh-aware transformers and equivariant neural networks.

Smoothness Errors in Dynamics Models and How to Avoid Them

TL;DR

This work analyzes smoothness errors in dynamics models that use graph and mesh neural networks, showing that strictly unitary convolutions can be overly constraining for diffusion-like PDE tasks. It derives a theoretical lower bound on unitary approximation error and introduces relaxed unitary convolutions, including Taylor truncation and encoder-decoder strategies, to balance smoothness preservation with expressive power. The framework is extended from graphs to meshes via mesh Rayleigh quotients and unitary mesh convolutions, enabling PDE surrogates on complex geometries. Empirical results on heat and wave PDEs on PyVista meshes and WeatherBench WB2 demonstrate that relaxed-unitary models (R-UniGraph, R-UniMesh) achieve state-of-the-art or competitive performance, particularly excelling in diffusive dynamics by aligning the learned smoothness with the true system’s smoothness. The findings highlight the importance of controllable smoothness in physics-guided learning and offer practical tools for accurate, mesh-aware PDE modeling.

Abstract

Modern neural networks have shown promise for solving partial differential equations over surfaces, often by discretizing the surface as a mesh and learning with a mesh-aware graph neural network. However, graph neural networks suffer from oversmoothing, where a node's features become increasingly similar to those of its neighbors. Unitary graph convolutions, which are mathematically constrained to preserve smoothness, have been proposed to address this issue. Despite this, in many physical systems, such as diffusion processes, smoothness naturally increases and unitarity may be overconstraining. In this paper, we systematically study the smoothing effects of different GNNs for dynamics modeling and prove that unitary convolutions hurt performance for such tasks. We propose relaxed unitary convolutions that balance smoothness preservation with the natural smoothing required for physical systems. We also generalize unitary and relaxed unitary convolutions from graphs to meshes. In experiments on PDEs such as the heat and wave equations over complex meshes and on weather forecasting, we find that our method outperforms several strong baselines, including mesh-aware transformers and equivariant neural networks.
Paper Structure (66 sections, 8 theorems, 51 equations, 11 figures, 8 tables)

This paper contains 66 sections, 8 theorems, 51 equations, 11 figures, 8 tables.

Key Result

Proposition 1

Given an undirected graph $\mathcal{G}$ on $n$ nodes with normalized adjacency matrix $\widetilde{\mathbf{A}} = \mathbf{D}^{-1/2}\mathbf{A}\mathbf{D}^{-1/2}$, the Rayleigh quotient is invariant under normalized unitary or orthogonal graph convolution, i.e. $R_{\mathcal{G}}(\mathbf{X}) = R_{\mathcal{

Figures (11)

  • Figure 1: Qualitative comparison of autoregressive model predictions for the heat equation on the armadillo mesh at timestep $T=190$. Our R-UniMesh model remains faithful to the ground truth during each step of the rollout, whereas the EMAN model over smooths and the Hermes model under smooths. A more complete comparison over several timesteps is in \ref{['sec:diagnostics']}, \ref{['tab:diagnostic']}.
  • Figure 2: Top: After zero padding, individual unitary blocks are stacked and the output is fed into an unconstrained decoder. Bottom: Each block uses Taylor truncated unitary convolution.
  • Figure 3: Left: The dotted lines indicate the mean Rayleigh quotient for input heat graphs at $T=3$ and target graphs at $T=4$. We also show the mean Rayleigh quotient for the best performing GCN, R-UniGraph, and Lie unitary models. R-UniGraph is best at capturing the true smoothness. Right: Validation MSE of the same three models. R-UniGraph has the best performance. Results for the full set of runs are provided in \ref{['sec:ensemble']}.
  • Figure 4: The Rayleigh quotient for each timestep on an unseen mesh for Hermes, EMAN, and R-UniMesh models. The R-UniMesh is the best at capturing the true smoothness for heat.
  • Figure 5: RMSE and ACC as a function of lead time for all models temperature prediction. R-UniMesh has a competitive RMSE, especially at early lead time. R-UniMesh also maintains viability for lead times of roughly $2$ days according to the ECMWF baseline. Exact values recorded in \ref{['tab:weather_results']} (\ref{['sec:wb_geo_extended_results']}).
  • ...and 6 more figures

Theorems & Definitions (21)

  • Definition 1: Rayleigh quotient, chung1997spectral
  • Proposition 1: Invariance of Rayleigh quotient, Proposition 6 in kiani2024unitary
  • Proposition 2: Theorem 4.8 in wang2023general
  • Theorem 1
  • Definition 2: Intrinsic Delaunay Criterion, bobenko2007discrete
  • Definition 3: Mesh Rayleigh Quotient
  • Corollary 1: Corollary to \ref{['prop:rq_preserved']}
  • Proposition 3: Proposition 7 in kiani2024unitary
  • Definition 4
  • Definition 5
  • ...and 11 more