Clifford-Steerable Convolutional Neural Networks

Maksim Zhdanov; David Ruhe; Maurice Weiler; Ana Lucic; Johannes Brandstetter; Patrick Forré

Clifford-Steerable Convolutional Neural Networks

Maksim Zhdanov, David Ruhe, Maurice Weiler, Ana Lucic, Johannes Brandstetter, Patrick Forré

TL;DR

Clifford-Steerable CNNs address the challenge of constructing equivariant neural networks for fields on pseudo-Euclidean spaces by combining Clifford algebra with implicit, equivariant kernel networks. The main idea is to implement $ ext{O}(p,q)$-steerable kernels through a kernel network $oldsymbol{ m \mathscr{K}}$ and a kernel head $H$ so that the resulting kernel $K=H\circ\boldsymbol{ m \mathscr{K}}$ satisfies the steerability constraint and yields an $ ext{E}(p,q)$-equivariant convolution. The paper develops the theoretical basis (Clifford grades as $ ext{O}(p,q)$-representations, Clifford-group equivariant nets) and demonstrates empirical gains on physics-informed tasks, including fluid dynamics and relativistic electrodynamics, while enabling extension to curved pseudo-Riemannian manifolds. This work provides a unified, scalable framework for processing multivector fields with full spacetime symmetries, offering improved accuracy and sample efficiency over conventional steerable CNNs and non-equivariant Clifford models. The approach has broad implications for PDE surrogates and physics-embedded learning where $ ext{E}(p,q)$-invariance plays a central role.

Abstract

We present Clifford-Steerable Convolutional Neural Networks (CS-CNNs), a novel class of $\mathrm{E}(p, q)$-equivariant CNNs. CS-CNNs process multivector fields on pseudo-Euclidean spaces $\mathbb{R}^{p,q}$. They cover, for instance, $\mathrm{E}(3)$-equivariance on $\mathbb{R}^3$ and Poincaré-equivariance on Minkowski spacetime $\mathbb{R}^{1,3}$. Our approach is based on an implicit parametrization of $\mathrm{O}(p,q)$-steerable kernels via Clifford group equivariant neural networks. We significantly and consistently outperform baseline methods on fluid dynamics as well as relativistic electrodynamics forecasting tasks.

Clifford-Steerable Convolutional Neural Networks

TL;DR

-steerable kernels through a kernel network

and a kernel head

so that the resulting kernel

satisfies the steerability constraint and yields an

-equivariant convolution. The paper develops the theoretical basis (Clifford grades as

-representations, Clifford-group equivariant nets) and demonstrates empirical gains on physics-informed tasks, including fluid dynamics and relativistic electrodynamics, while enabling extension to curved pseudo-Riemannian manifolds. This work provides a unified, scalable framework for processing multivector fields with full spacetime symmetries, offering improved accuracy and sample efficiency over conventional steerable CNNs and non-equivariant Clifford models. The approach has broad implications for PDE surrogates and physics-embedded learning where

-invariance plays a central role.

Abstract

We present Clifford-Steerable Convolutional Neural Networks (CS-CNNs), a novel class of

-equivariant CNNs. CS-CNNs process multivector fields on pseudo-Euclidean spaces

. They cover, for instance,

-equivariance on

and Poincaré-equivariance on Minkowski spacetime

. Our approach is based on an implicit parametrization of

-steerable kernels via Clifford group equivariant neural networks. We significantly and consistently outperform baseline methods on fluid dynamics as well as relativistic electrodynamics forecasting tasks.

Paper Structure (44 sections, 10 theorems, 142 equations, 11 figures, 1 table, 3 algorithms)

This paper contains 44 sections, 10 theorems, 142 equations, 11 figures, 1 table, 3 algorithms.

Introduction
Theoretical Background
Pseudo-Euclidean spaces and groups
Pseudo-Euclidean vector spaces
Pseudo-Euclidean groups
Feature vector fields & Steerable CNNs
Feature vector fields
Steerable CNNs
The Clifford Algebra & Clifford Group Equivariant Neural Networks
Introduction to the Clifford algebra
Clifford grades as $\operatorname{O}\mkern-.5mu(p,\mkern-2muq\mkern-.5mu)$-representations
$\operatorname{O}\mkern-.5mu(p,\mkern-2muq\mkern-.5mu)$-equivariant Clifford Neural Nets
Clifford-Steerable CNNs
Experimental Results
Conclusions
...and 29 more sections

Key Result

Theorem 2.12

Consider a layer ${L: \Gamma(\mathbb{R}^{p,q},W_\textup{in}) \to \Gamma(\mathbb{R}^{p,q},W_\textup{out})}$ mapping between feature fields of types $(W_\textup{in},\rho_\textup{in})$ and $(W_\textup{out},\rho_\textup{out})$, respectively. If $L$ is demanded to be linear and $\mathrm{Aff}(G)$-equivari

Figures (11)

Figure 1: CS-CNNs process multivector fields while respecting $\operatorname{E}(p,q)$-equivariance. Shown here is a Lorentz-boost $\operatorname{O}(1,1)$ of electromagnetic data on 1+1-dimensional spacetime $\mathbb{R}^{1,1}$.
Figure 2: CS-CNNs process multivector fields while respecting $\operatorname{E}(p,q)$-equivariance. Shown here is a Lorentz-boost $\operatorname{O}(1,1)$ of electromagnetic data on 1+1-dimensional spacetime $\mathbb{R}^{1,1}$.
Figure 3: Examples of pseudo-Euclidean spaces $\mathbb{R}^{2,0}$ and $\mathbb{R}^{1,1}$. Colors depict $\operatorname{O}(p,q)$-orbits, given by sets of all points $v\in\mathbb{R}^{p,q}$ with the same squared distance$\eta^{p,q}(v,v)$ from the origin.
Figure 4: Left: Multi-vector valued output of the kernel-network $\mathscr{K}$ for ${{c_\mathrm{in}} \mkern-2mu = \mkern-2mu {c_\mathrm{out}} \mkern-2mu = \mkern-2mu 1, (p,\mkern-2muq) \mkern-2mu = \mkern-2mu (1,\mkern-2mu1)}$, and its expansion to a full ${\operatorname{O}(1,\mkern-3mu1)}$-steerable kernel via the kernel head $H$. Right: Commutative diagram of the construction and ${\operatorname{O}(p,\mkern-2muq)}$-equivariance of implicit steerable kernels ${K\mkern-3mu=\mkern-2muH \mkern-2mu\circ\mkern-2mu \mathscr{K}}$, composed from a kernel network $\mathscr{K}$ with ${{c_\mathrm{out}}\!\times\!{c_\mathrm{in}}}$ multivector outputs and the kernel head $H$. The two inner squares show the individual equivariance of $\mathscr{K}$ and $H$, from which the kernels' overall equivariance follows. We abbreviate $\operatorname{Cl}(\mathbb{R}^{p,q})$ by $\operatorname{Cl}$.
Figure 5: Plots 1 & 2: Mean squared errors (MSEs) on the Navier-Stokes 2D and Maxwell 3D forecasting tasks (one-step loss) as a function of number of training simulations. Plot 3: Convergence (test loss) of our model vs. a basic ResNet on the relativistic Maxwell task. Plot 4: Relative $\operatorname{O}(2)$-equivariance errors of different models. $G$-FNOs fail as they cannot correctly ingest multivector data.
...and 6 more figures

Theorems & Definitions (63)

Definition 2.1: Pseudo-Euclidean vector space
Definition 2.2: Standard pseudo-Euclidean vector spaces
Example 2.3
Definition 2.4: Translation groups
Definition 2.5: Pseudo-orthogonal groups
Example 2.6
Definition 2.7: Pseudo-Euclidean groups
Example 2.8
Definition 2.9: Feature vector field
Remark 2.10
...and 53 more

Clifford-Steerable Convolutional Neural Networks

TL;DR

Abstract

Clifford-Steerable Convolutional Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (63)